You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/content-understanding/audio/overview.md
+9-5Lines changed: 9 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -37,9 +37,10 @@ Content Understanding serves as a cornerstone for Media Asset Management solutio
37
37
38
38
***Transcription**. Converts conversational audio into searchable and analyzable text-based transcripts in WebVTT format. Customizable fields can be generated from transcription data. Sentence-level and word-level timestamps are available upon request.
39
39
40
-
> [!NOTE]r
40
+
> [!NOTE]
41
+
>
41
42
> Content Understanding supports the full set of [Azure AI Speech Speech to text languages](../../speech-service/language-support.md).
42
-
> For languages with Fast transcriptions support and for files ≤ 300 MB and/or ≤ 2 hours, transcription time is reduced substantially.
43
+
> For languages with fast transcriptions support and for files ≤ 300 MB and/or ≤ 2 hours, transcription time is reduced substantially.
43
44
44
45
***Diarization**. Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers.
45
46
@@ -48,13 +49,15 @@ Content Understanding serves as a cornerstone for Media Asset Management solutio
48
49
***Multilingual transcription**. Generates multilingual transcripts, applying language/locale per phrase. Deviating from language detection this feature is enabled when no language/locale is specified or language is set to `auto`.
***Language detection**. Automatically detects the dominant language/locale which is used to transcribe the file. Set multiple languages/locales to enable language detection.
56
58
57
59
> [!NOTE]
60
+
>
58
61
> For files larger than 300 MB and/or longer than 2 hours and locales unsupported by Fast transcription, the file is processed generating a multilingual transcript based on the specified locales.
59
62
> In case language detection fails, the first language/locale defined is used to transcribe the file.
60
63
@@ -80,7 +83,8 @@ The prebuilt analyzers allow extracting valuable insights into audio content wit
80
83
All audio analyzers generate transcripts in standard WEBVTT format separated by speaker.
81
84
82
85
> [!NOTE]
83
-
> Prebuilt analyzers are set to use multilingual transcription and returnDetails enabled!
86
+
>
87
+
> Prebuilt analyzers are set to use multilingual transcription and `returnDetails` enabled.
84
88
85
89
Content Understanding offers the following prebuilt analyzers:
0 commit comments