Skip to content

Commit 116b5af

Browse files
Merge pull request #6374 from eric-urban/eur/acc-in-foundry
use audio content creation in ai foundry
2 parents cca72c5 + 073dbb5 commit 116b5af

File tree

8 files changed

+330
-210
lines changed

8 files changed

+330
-210
lines changed

articles/ai-services/connect-services-ai-foundry-portal.md

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -73,10 +73,6 @@ In the project-level playgrounds, you can try Azure AI services such as Azure AI
7373

7474
:::image type="content" source="./media/ai-foundry/azure-ai-services-playgrounds.png" alt-text="Screenshot of the project level playgrounds that you can use." lightbox="./media/ai-foundry/azure-ai-services-playgrounds.png":::
7575

76-
1. Optionally, you can select a different connection to use in the playground.
77-
78-
:::image type="content" source="./media/ai-foundry/speech-playground.png" alt-text="Screenshot of the Speech playground in a project." lightbox="./media/ai-foundry/speech-playground.png":::
79-
8076
## Fine-tune Azure AI services models
8177

8278
In Azure AI Foundry portal, you can fine-tune some Azure AI services models. For example, you can fine-tune a model for custom speech.
@@ -102,7 +98,7 @@ Once you have a project, several Azure AI services models are already deployed a
10298

10399
In this example, we see six Azure AI Services deployments (such as Azure AI Speech and Azure AI Language) via the default connection. These models were already available for use when you created the project.
104100

105-
There's no option to deploy Azure AI services models from the **Models + endpoints** page. Azure AI services models are already deployed and ready to use.
101+
You don't deploy Azure AI services models from the **Models + endpoints** page. Azure AI services models are already deployed and ready to use.
106102

107103
However, you can deploy [fine-tuned Azure AI services models](#fine-tune-azure-ai-services-models). For example, you might want to deploy a custom speech model that you fine-tuned. In this case, you can deploy the model from the corresponding fine-tuning page.
108104

articles/ai-services/includes/quickstarts/ai-foundry-prerequisites.md

Lines changed: 0 additions & 14 deletions
This file was deleted.
Binary file not shown.

articles/ai-services/speech-service/get-started-speech-to-text.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: quickstart
9-
ms.date: 7/16/2025
9+
ms.date: 8/5/2025
1010
ms.author: eur
1111
ms.reviewer: eur
1212
ms.custom: devx-track-python, devx-track-js, devx-track-csharp, mode-other, devx-track-extended-java, devx-track-go, build-2024, ignite-2024

articles/ai-services/speech-service/how-to-audio-content-creation.md

Lines changed: 19 additions & 184 deletions
Large diffs are not rendered by default.
Lines changed: 121 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,121 @@
1+
---
2+
author: eric-urban
3+
ms.author: eur
4+
ms.service: azure-ai-speech
5+
ms.topic: include
6+
ms.date: 8/5/2025
7+
---
8+
9+
## Prerequisites
10+
11+
- An active Azure subscription. [Create one for free](https://azure.microsoft.com/free/ai-services).
12+
- Permission to create resources in your subscription.
13+
- An Azure AI Foundry project. For more information, see [Create an Azure AI Foundry project](/azure/ai-foundry/how-to/create-projects).
14+
15+
## Use the audio content creation tool
16+
17+
The following diagram displays the process for fine-tuning the text to speech outputs.
18+
19+
:::image type="content" source="../../../media/audio-content-creation/audio-content-creation-diagram.jpg" alt-text="Diagram of the sequence of steps for fine-tuning text to speech outputs." lightbox="../../../media/audio-content-creation/audio-content-creation-diagram.jpg":::
20+
21+
### Access the tool
22+
23+
To access the audio content creation tool in Azure AI Foundry, follow these steps:
24+
25+
1. Go to your project in [Azure AI Foundry](https://ai.azure.com/?cid=learnDocs).
26+
1. Select **Playgrounds** from the left pane.
27+
1. In the **Speech playground** tile, select **Try the Speech playground**.
28+
1. Select **Text to speech** > **Audio content creation**. You might need to scroll to find the tile.
29+
30+
:::image type="content" source="../../../media/voice-live/foundry-portal/capabilities-by-scenario.png" alt-text="Screenshot of filtering Speech service capabilities by scenario." lightbox="../../../media/voice-live/foundry-portal/capabilities-by-scenario.png":::
31+
32+
### Workflow overview
33+
34+
Once you have access to the tool, follow this general workflow:
35+
36+
1. [Create an audio tuning file](#create-an-audio-tuning-file) by using plain text or SSML scripts. Enter or upload your content into audio content creation.
37+
1. Choose the voice and the language for your script content. Audio content creation includes all of the [standard text to speech voices](../../../language-support.md?tabs=tts). You can use standard voices or a custom voice.
38+
39+
> [!NOTE]
40+
> Custom voice access is [limited](/azure/ai-foundry/responsible-ai/speech-service/text-to-speech/limited-access) based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).
41+
42+
1. Select the content you want to preview, and then select **Play** (via the triangle icon) to preview the default synthesis output.
43+
44+
If you make any changes to the text, select the **Stop** icon, and then select **Play** again to regenerate the audio with changed scripts.
45+
46+
Improve the output by adjusting pronunciation, break, pitch, rate, intonation, voice style, and more. For a complete list of options, see [Speech Synthesis Markup Language](../../../speech-synthesis-markup.md).
47+
48+
1. Save and [export your tuned audio](#export-tuned-audio).
49+
50+
When you save the tuning track in the system, you can continue to work and iterate on the output. When you're satisfied with the output, you can create an audio creation task with the export feature. You can observe the status of the export task and download the output for use with your apps and products.
51+
52+
## Create an audio tuning file
53+
54+
You can get your content into the audio content creation tool in either of two ways:
55+
56+
### Option 1: Create a new audio tuning file
57+
58+
1. Select **New** > **Text file** to create a new audio tuning file.
59+
60+
1. Enter or paste your content into the editing window. The allowable number of characters for each file is 20,000 or fewer. If your script contains more than 20,000 characters, you can use Option 2 to automatically split your content into multiple files.
61+
62+
1. Select **Save**.
63+
64+
### Option 2: Upload an audio tuning file
65+
66+
1. Select **Upload** > **Text file** to import one or more text files. Both plain text and SSML are supported.
67+
68+
If your script file is more than 20,000 characters, split the content by paragraphs, by characters, or by regular expressions.
69+
70+
1. When you upload your text files, make sure that they meet these requirements:
71+
72+
| Property | Description |
73+
|----------|---------------|
74+
| File format | Plain text (.txt) or SSML text (.txt)<br/><br/>Zip files aren't supported. |
75+
| Encoding format | UTF-8 |
76+
| File name | Each file must have a unique name. Duplicate files aren't supported. |
77+
| Text length | Character limit is 20,000. If your files exceed the limit, split them according to the instructions in the tool. |
78+
| SSML restrictions | Each SSML file can contain only a single piece of SSML. |
79+
80+
81+
Here's a plain text example:
82+
83+
```txt
84+
Welcome to use audio content creation to customize audio output for your products.
85+
```
86+
87+
Here's an SSML example:
88+
89+
```xml
90+
<speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" version="1.0" xml:lang="en-US">
91+
<voice name="en-US-AvaMultilingualNeural">
92+
Welcome to use audio content creation <break time="10ms" />to customize audio output for your products.
93+
</voice>
94+
</speak>
95+
```
96+
97+
## Export tuned audio
98+
99+
After you review your audio output and are satisfied with your tuning and adjustment, you can export the audio.
100+
101+
1. Select **Export** to create an audio creation task.
102+
103+
We recommend **Export to Audio library** to easily store, find, and search audio output in the cloud. You can better integrate with your applications through Azure blob storage. You can also download the audio to your local disk directly.
104+
105+
1. Choose the output format for your tuned audio. The **supported audio formats and sample rates** are listed in the following table:
106+
107+
| Format | 8 kHz sample rate | 16 kHz sample rate | 24 kHz sample rate | 48 kHz sample rate |
108+
|--- |--- |--- |--- |--- |
109+
| wav | riff-8khz-16bit-mono-pcm | riff-16khz-16bit-mono-pcm | riff-24khz-16bit-mono-pcm |riff-48khz-16bit-mono-pcm |
110+
| mp3 | N/A | audio-16khz-128kbitrate-mono-mp3 | audio-24khz-160kbitrate-mono-mp3 |audio-48khz-192kbitrate-mono-mp3 |
111+
112+
1. To view the status of the task, select the **Task list** tab.
113+
114+
If the task fails, see the detailed information page for a full report.
115+
116+
1. When the task is complete, your audio is available for download on the **Audio library** pane.
117+
118+
1. Select the file you want to download and **Download**.
119+
120+
Now you're ready to use your custom tuned audio in your apps or products.
121+

0 commit comments

Comments
 (0)