Skip to content

Commit 88885a4

Browse files
authored
Update overview.md
1 parent 5b00530 commit 88885a4

File tree

1 file changed

+9
-5
lines changed
  • articles/ai-services/content-understanding/audio

1 file changed

+9
-5
lines changed

articles/ai-services/content-understanding/audio/overview.md

Lines changed: 9 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -37,9 +37,10 @@ Content Understanding serves as a cornerstone for Media Asset Management solutio
3737

3838
* **Transcription**. Converts conversational audio into searchable and analyzable text-based transcripts in WebVTT format. Customizable fields can be generated from transcription data. Sentence-level and word-level timestamps are available upon request.
3939

40-
> [!NOTE]r
40+
> [!NOTE]
41+
>
4142
> Content Understanding supports the full set of [Azure AI Speech Speech to text languages](../../speech-service/language-support.md).
42-
> For languages with Fast transcriptions support and for files ≤ 300 MB and/or ≤ 2 hours, transcription time is reduced substantially.
43+
> For languages with fast transcriptions support and for files ≤ 300 MB and/or ≤ 2 hours, transcription time is reduced substantially.
4344
4445
* **Diarization**. Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers.
4546

@@ -48,13 +49,15 @@ Content Understanding serves as a cornerstone for Media Asset Management solutio
4849
* **Multilingual transcription**. Generates multilingual transcripts, applying language/locale per phrase. Deviating from language detection this feature is enabled when no language/locale is specified or language is set to `auto`.
4950

5051
> [!NOTE]
52+
>
5153
> The following locales are currently supported:
52-
> **Files ≤ 300 MB and/or ≤ 2 hours**: de-DE, en-AU, en-CA, en-GB, en-IN, en-US, es-ES, es-MX, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, and zh-CN.
53-
> **Files larger than 300 MB and/or longer than 4 hours**: en-US, es-ES, es-MX, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, pt-BR, zh-CN.
54+
> * **Files ≤ 300 MB and/or ≤ 2 hours**: de-DE, en-AU, en-CA, en-GB, en-IN, en-US, es-ES, es-MX, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, and zh-CN.
55+
> * **Files larger than 300 MB and/or longer than 4 hours**: en-US, es-ES, es-MX, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, pt-BR, zh-CN.
5456
5557
* **Language detection**. Automatically detects the dominant language/locale which is used to transcribe the file. Set multiple languages/locales to enable language detection.
5658

5759
> [!NOTE]
60+
>
5861
> For files larger than 300 MB and/or longer than 2 hours and locales unsupported by Fast transcription, the file is processed generating a multilingual transcript based on the specified locales.
5962
> In case language detection fails, the first language/locale defined is used to transcribe the file.
6063
@@ -80,7 +83,8 @@ The prebuilt analyzers allow extracting valuable insights into audio content wit
8083
All audio analyzers generate transcripts in standard WEBVTT format separated by speaker.
8184

8285
> [!NOTE]
83-
> Prebuilt analyzers are set to use multilingual transcription and returnDetails enabled!
86+
>
87+
> Prebuilt analyzers are set to use multilingual transcription and `returnDetails` enabled.
8488
8589
Content Understanding offers the following prebuilt analyzers:
8690

0 commit comments

Comments
 (0)