Skip to content

Commit 6baa222

Browse files
authored
Update overview.md
1 parent 9b3ffeb commit 6baa222

File tree

1 file changed

+5
-16
lines changed
  • articles/ai-services/content-understanding/video

1 file changed

+5
-16
lines changed

articles/ai-services/content-understanding/video/overview.md

Lines changed: 5 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -55,12 +55,10 @@ Calling prebuilt-video with no custom schema returns a document like the followi
5555
# Video: 00:00.000 → 00:30.000
5656
Width: 1280 · Height: 720
5757

58-
## Segment 1 00:00.000 → 00:06.400
59-
A lively gathering in a room decorated with colorful banners and balloons. Party guests watch a TV showing a sports event while a young man kneels excitedly in front. Snacks and drinks underline the festive mood.
60-
61-
**Transcript**
58+
Transcript
6259
WEBVTT
63-
00:03.600 → 00:06.000 <1 Speaker> Get New Years ready.
60+
00:03.600 --> 00:06.000 <1 Speaker> Get new years ready.
61+
00:11.120 --> 00:13.520 <1 Speaker>Find your style for the new year
6462

6563
**Key frames**
6664
- 00:00.600 ![KF](keyFrame.600.jpg)
@@ -71,16 +69,7 @@ Calling prebuilt-video with no custom schema returns a document like the followi
7169
- 00:05.600 ![KF](keyFrame.5600.jpg)
7270
- 00:06.200 ![KF](keyFrame.6200.jpg)
7371

74-
## Segment 2 00:06.400 → 00:10.080
75-
The room erupts into a vibrant party scene—people dancing under soccer-themed décor, flags waving, energy soaring.
76-
77-
**Key frames**
78-
- 00:07.080 ![KF](keyFrame.7080.jpg)
79-
- 00:07.760 ![KF](keyFrame.7760.jpg)
80-
- 00:08.560 ![KF](keyFrame.8560.jpg)
81-
- 00:09.360 ![KF](keyFrame.9360.jpg)
82-
83-
*…additional segments omitted for brevity…*
72+
*…additional data omitted for brevity…*
8473
````
8574

8675
## Walk-through
@@ -113,8 +102,8 @@ The first pass is all about extracting a first set of details—who's speaking,
113102
> When Multilingual transcription is used, any files with unsupported locales produce a result based on the closest supported locale, which is likely incorrect. This result is a known
114103
> behavior. Avoid transcription quality issues by ensuring that you configure locales when not using a multilingual transcription supported locale!
115104

116-
* **Shot detection:** Identifies segments of the video aligned with shot boundaries where possible, allowing for precise editing and repackaging of content with breaks exactly on shot boundaries.
117105
* **Key frame extraction:** Extracts key frames from videos to represent each shot completely, ensuring each shot has enough key frames to enable field extraction to work effectively.
106+
* **Shot detection:** Identifies segments of the video aligned with shot boundaries where possible, allowing for precise editing and repackaging of content with breaks exactly on shot boundaries. This is output as a
118107

119108
## Field extraction and segmentation
120109

0 commit comments

Comments
 (0)