Skip to content

Commit 50f80a8

Browse files
committed
Updateing audio image and page for less product name callout
1 parent 937c886 commit 50f80a8

File tree

3 files changed

+16
-18
lines changed

3 files changed

+16
-18
lines changed

articles/ai-services/content-understanding/audio/overview.md

Lines changed: 16 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -20,17 +20,17 @@ ms.custom: release-preview-2-cu
2020
> * Features, approaches, and processes can change or have limited capabilities, before General Availability (GA).
2121
> * For more information, *see* [**Supplemental Terms of Use for Microsoft Azure Previews**](https://azure.microsoft.com/support/legal/preview-supplemental-terms).
2222
23-
Content Understanding audio analyzers enable transcription and diarization of conversational audio, extracting structured fields such as summaries, sentiments, and key topics. Customize an audio analyzer template to your business needs using [Azure AI Foundry portal](https://ai.azure.com/) to start generating results.
23+
Audio analyzers enable transcription and diarization of conversational audio, extracting structured fields such as summaries, sentiments, and key topics. Customize an audio analyzer template to your business needs using [Azure AI Foundry portal](https://ai.azure.com/) to start generating results.
2424

25-
Here are common scenarios for using Content Understanding with conversational audio data:
25+
Here are common scenarios for conversational audio data processing:
2626

2727
* Gain customer insights through summarization and sentiment analysis.
2828
* Assess and verify call quality and compliance in call centers.
2929
* Create automated summaries and metadata for podcast publishing.
3030

3131
## Audio analyzer capabilities
3232

33-
:::image type="content" source="../media/audio/overview/workflow-diagram.png" lightbox="../media/audio/overview/workflow-diagram.png" alt-text="Illustration of Content Understanding audio workflow.":::
33+
:::image type="content" source="../media/audio/overview/workflow-diagram.png" lightbox="../media/audio/overview/workflow-diagram.png" alt-text="Illustration of Content Understanding audio capabilities.":::
3434

3535
Content Understanding serves as a cornerstone for Speech Analytics solutions, enabling the following capabilities for audio files:
3636

@@ -39,23 +39,21 @@ Content Understanding serves as a cornerstone for Speech Analytics solutions, en
3939
* **Transcription**. Converts conversational audio into searchable and analyzable text-based transcripts in WebVTT format. Customizable fields can be generated from transcription data. Sentence-level and word-level timestamps are available upon request.
4040

4141
> [!NOTE]
42-
> Content Understanding supports the full set of [Azure AI Speech Speech to text languages](https://learn.microsoft.com/azure/ai-services/speech-service/language-support?tabs=stt).
42+
> Content Understanding supports the full set of [Azure AI Speech Speech to text languages](../../speech-service/language-support?tabs=stt).
4343
> For languages with Fast transcriptions support and for files ≤ 300 MB and/or ≤ 2 hours, transcription time is reduced substantially.
4444
4545
* **Diarization**. Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers.
4646

4747
* **Speaker role detection**. Identifies agent and customer roles within contact center call data.
4848

4949
* **Multilingual transcription**. Generates multilingual transcripts, applying language/locale per phrase. Deviating from language detection this feature is enabled when no language/locale is specified or language is set to 'auto'.
50-
51-
> [!NOTE]
52-
> The following locales are currently supported:
53-
> **Files ≤ 300 MB and/or ≤ 2 hours**: de-DE, en-AU, en-CA, en-GB, en-IN, en-US, es-ES, es-MX, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, and zh-CN.
54-
> **Files larger than 300 MB and/or longer than 4 hours**: en-US, es-ES, es-MX, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, pt-BR, zh-CN.
50+
<br>The following locales are currently supported for multilingual transcription:
51+
* **Files ≤ 300 MB and/or ≤ 2 hours**: de-DE, en-AU, en-CA, en-GB, en-IN, en-US, es-ES, es-MX, fr-CA, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, and zh-CN.
52+
* **Files larger than 300 MB and/or longer than 4 hours**: en-US, es-ES, es-MX, fr-FR, hi-IN, it-IT, ja-JP, ko-KR, pt-BR, zh-CN.
5553

5654
> [!WARNING]
57-
> When Multilingual transcription is used, a file with an unsupported locale will still produce a result. This result will be based on the closest locale but most likely not correct.
58-
> This is a know behavior. Please make sure to configure locales when not using Multilingual transcription!
55+
> When Multilingual transcription is used, a file with an unsupported locale produces a result. This result is based on the closest locale but most likely not correct.
56+
> This result is a known behavior. Make sure to configure locales when not using Multilingual transcription!
5957
6058

6159
* **Language detection**. Automatically detects the dominant language/locale which is used to transcribe the file. Set multiple languages/locales to enable language detection.
@@ -69,7 +67,7 @@ Content Understanding serves as a cornerstone for Speech Analytics solutions, en
6967
Field extraction allows you to extract structured data from audio files, such as summaries, sentiments, and mentioned entities from call logs. You can begin by customizing a suggested analyzer template or creating one from scratch.
7068

7169
## Key Benefits
72-
Content Understanding offers advanced audio capabilities, including:
70+
Advanced audio capabilities, including:
7371

7472
* **Customizable data extraction**. Tailor the output to your specific needs by modifying the field schema, allowing for precise data generation and extraction.
7573

@@ -79,7 +77,7 @@ Content Understanding offers advanced audio capabilities, including:
7977

8078
* **Scenario adaptability**. Adapt the service to your requirements by generating custom fields and extract relevant data.
8179

82-
## Content Understanding prebuild audio analyzers
80+
## Prebuild audio analyzers
8381

8482
The prebuild analyzers allow extracting valuable insights into audio content without the need to create an analyzer setup.
8583

@@ -88,7 +86,7 @@ All audio analyzers generate transcripts in standard WEBVTT format separated by
8886
> [!NOTE]
8987
> Prebuild analyzers are set to use multilingual transcription and returnDetails enabled!
9088
91-
Content Understanding offers the following prebuild analyzers:
89+
The following prebuild analyzers are available:
9290

9391
**Post-call analysis (prebuilt-callCenter)**. Analyze call recordings to generate:
9492
- conversation transcripts with speaker role detection result
@@ -270,9 +268,9 @@ You can also customize prebuild analyzers for more fine-grained control of the o
270268
- and more
271269

272270
## Conversational Knowledge Mining Solution Accelerator
273-
To provide an end-2-end quickstart for Speech Analytics solutions based on Content Understanding, please refer to the [Conversation knowledge mining solution accelerator](https://aka.ms/Conversational-Knowledge-Mining).
271+
For an end-2-end quickstart for Speech Analytics solutions, refer to the [Conversation knowledge mining solution accelerator](https://aka.ms/Conversational-Knowledge-Mining).
274272

275-
Gain actionable insights from large volumes of conversational data by identifying key themes, patterns, and relationships. Using Azure AI Foundry, Azure AI Content Understanding, Azure OpenAI Service, and Azure AI Search, this solution analyzes unstructured dialogue and maps it to meaningful, structured insights.
273+
Gain actionable insights from large volumes of conversational data by identifying key themes, patterns, and relationships. By using Azure AI Foundry, Azure AI Content Understanding, Azure OpenAI Service, and Azure AI Search, this solution analyzes unstructured dialogue and maps it to meaningful, structured insights.
276274

277275
Capabilities such as topic modeling, key phrase extraction, speech-to-text transcription, and interactive chat enable users to explore data naturally and make faster, more informed decisions.
278276

@@ -287,11 +285,11 @@ For a complete list of supported regions, languages, and locales, see our [Langu
287285

288286
## Data privacy and security
289287

290-
Developers using Content Understanding should review Microsoft's policies on customer data. For more information, visit our [Data, protection, and privacy](https://www.microsoft.com/trust-center/privacy) page.
288+
Developers using this service should review Microsoft's policies on customer data. For more information, visit our [Data, protection, and privacy](https://www.microsoft.com/trust-center/privacy) page.
291289

292290
## Next steps
293291

294-
* Try processing your audio content using Content Understanding in [**Azure AI Foundry portal**](https://aka.ms/cu-landing).
292+
* Try processing your audio content in [**Azure AI Foundry portal**](https://aka.ms/cu-landing).
295293
* Learn how to analyze audio content [**analyzer templates**](../quickstart/use-ai-foundry.md).
296294
* Review code sample: [**audio content extraction**](https://github.com/Azure-Samples/azure-ai-content-understanding-python/blob/main/notebooks/content_extraction.ipynb).
297295
* Review code sample: [**analyzer templates**](https://github.com/Azure-Samples/azure-ai-content-understanding-python/tree/main/analyzer_templates).
485 KB
Loading
159 KB
Loading

0 commit comments

Comments
 (0)