You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/content-understanding/audio/overview.md
+11-11Lines changed: 11 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,13 +2,13 @@
2
2
title: Azure AI Content Understanding audio overview
3
3
titleSuffix: Azure AI services
4
4
description: Learn about Azure AI Content Understanding audio solutions
5
-
author: laujan
5
+
author: jagoerge
6
6
ms.author: lajanuar
7
7
manager: nitinme
8
8
ms.service: azure-ai-content-understanding
9
9
ms.topic: overview
10
-
ms.date: 03/18/2025
11
-
ms.custom: ignite-2024-understanding-release
10
+
ms.date: 05/06/2025
11
+
ms.custom: release-preview-2-cu
12
12
---
13
13
14
14
@@ -40,13 +40,13 @@ Content Understanding serves as a cornerstone for Media Asset Management solutio
40
40
41
41
> [!NOTE]
42
42
> Content Understanding supports the full set of [Azure AI Speech Speech to text languages](https://learn.microsoft.com/azure/ai-services/speech-service/language-support?tabs=stt).
43
-
> For languages with Fast transcriptions support and for files ≤ 300 MB and/or ≤ 2 hours, transcription time will be reduced substantially.
43
+
> For languages with Fast transcriptions support and for files ≤ 300 MB and/or ≤ 2 hours, transcription time is reduced substantially.
44
44
45
-
***Diarization**. Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers.
45
+
***Dualization**. Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers.
46
46
47
47
***Speaker role detection**. Identifies agent and customer roles within contact center call data.
48
48
49
-
***Multilingual transcription**. Generates multilingual transcripts, applying language/locale per phrase. Deviating from language detection this is enabled when no language/locale is specified or language is set to 'auto'.
49
+
***Multilingual transcription**. Generates multilingual transcripts, applying language/locale per phrase. Deviating from language detection this feature is enabled when no language/locale is specified or language is set to 'auto'.
50
50
51
51
> [!NOTE]
52
52
> The following locales are currently supported:
@@ -56,8 +56,8 @@ Content Understanding serves as a cornerstone for Media Asset Management solutio
56
56
***Language detection**. Automatically detects the dominant language/locale which is used to transcribe the file. Set multiple languages/locales to enable language detection.
57
57
58
58
> [!NOTE]
59
-
> For files larger than 300 MB and/or longer than 2 hours and locales unsupported by Fast transcription, the file will be processed generating a multilingual transcript based on the specified locales.
60
-
> In case language detection fails, the first language/locale defined will be used to transcribe the file.
59
+
> For files larger than 300 MB and/or longer than 2 hours and locales unsupported by Fast transcription, the file is processed generating a multilingual transcript based on the specified locales.
60
+
> In case language detection fails, the first language/locale defined is used to transcribe the file.
61
61
62
62
### Field extraction
63
63
@@ -81,15 +81,15 @@ The prebuild analyzers allow extracting valuable insights into audio content wit
81
81
All audio analyzers generate transcripts in standard WEBVTT format separated by speaker.
82
82
83
83
> [!NOTE]
84
-
> Prebuild analyzers are set to use multilingual transcription and have returnDetails enabled!
84
+
> Prebuild analyzers are set to use multilingual transcription and returnDetails enabled!
85
85
86
86
Content Understanding offers the following prebuild analyzers:
87
87
88
88
**Post-call analysis (prebuilt-callCenter)**. Analyze call recordings to generate:
89
89
- conversation transcripts with speaker role detection result
90
90
- call summary
91
91
- call sentiment
92
-
- top 5 topics mentioned
92
+
- top five articles mentioned
93
93
- list of companies mentioned
94
94
- list of people (name and title/role) mentioned
95
95
- list of relevant call categories
@@ -259,7 +259,7 @@ Content Understanding offers the following prebuild analyzers:
259
259
```
260
260
261
261
You can also customize prebuild analyzers for more fine-grained control of the output by defining custom fields. Customization allows you to use the full power of generative models to extract deep insights from the audio. For example, customization allows you to:
262
-
- Generate additional insights
262
+
- Generate other insights
263
263
- Control the language of the field extraction output
0 commit comments