Skip to content

Commit 22e0ba9

Browse files
kbrowne8eric-urban
andauthored
Update articles/ai-studio/quickstarts/multimodal-vision.md
Co-authored-by: Eric Urban <[email protected]>
1 parent b3a74e9 commit 22e0ba9

1 file changed

Lines changed: 5 additions & 5 deletions

File tree

articles/ai-studio/quickstarts/multimodal-vision.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -152,11 +152,11 @@ In this chat session, you'll be instructing the assistant to aid in understandin
152152

153153
Below are the known limitations of the video prompt enhancements.
154154

155-
1. **Low Resolution:** The frames are analyzed using GPT-4 Turbo with Vision's "low resolution" setting, which may affect the accuracy of small object and text recognition in the video.
156-
2. **Video File Limits:** Both mp4 and mov file types are supported. In the Azure AI Playground, videos must be less than 3 minutes long. When using the API there is no such limitation.
157-
3. **Prompt Limits:** Video prompts only contain one video and no images. In Playground, you can clear the session to try with another video or images.
158-
4. **Limited Frame Selection:** Currently the system selects 20 frames from the entire video, which might not capture all critical moments or details. Frame selection can either be approximately evenly spread through the video or focused by a specific a Video Retrieval query, depending on the prompt.
159-
5. **Language Support:** Currently, the system primarily supports English for grounding with transcripts. Transcripts don't provide accurate information on lyrics from songs.
155+
- **Low resolution:** The frames are analyzed using GPT-4 Turbo with Vision's "low resolution" setting, which may affect the accuracy of small object and text recognition in the video.
156+
- **Video file limits:** Both mp4 and mov file types are supported. In the Azure AI Playground, videos must be less than 3 minutes long. When using the API there is no such limitation.
157+
- **Prompt limits:** Video prompts only contain one video and no images. In Playground, you can clear the session to try with another video or images.
158+
- **Limited frame selection:** Currently the system selects 20 frames from the entire video, which might not capture all critical moments or details. Frame selection can either be approximately evenly spread through the video or focused by a specific a Video Retrieval query, depending on the prompt.
159+
- **Language support:** Currently, the system primarily supports English for grounding with transcripts. Transcripts don't provide accurate information on lyrics from songs.
160160

161161
---
162162

0 commit comments

Comments
 (0)