Skip to content

Commit 56072fc

Browse files
Merge pull request #265629 from eric-urban/eur/text-to-speech-updates
custom voice API samples etc
2 parents c721974 + 5e33631 commit 56072fc

File tree

3 files changed

+11
-0
lines changed

3 files changed

+11
-0
lines changed

articles/ai-services/speech-service/custom-neural-voice.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,11 @@ Here's an overview of the steps to create a custom neural voice in Speech Studio
4848

4949
You can tune, adjust, and use your custom voice, similarly as you would use a prebuilt neural voice. Convert text into speech in real-time, or generate audio content offline with text input. You use the [REST API](./rest-text-to-speech.md), the [Speech SDK](./get-started-text-to-speech.md), or the [Speech Studio](https://speech.microsoft.com/audiocontentcreation).
5050

51+
> [!TIP]
52+
> You can also use the Speech SDK and custom voice REST API to train a custom neural voice.
53+
>
54+
> Check out the code samples in the [Speech SDK repository on GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/custom-voice/README.md) to see how to use personal voice in your application.
55+
5156
The style and the characteristics of the trained voice model depend on the style and the quality of the recordings from the voice talent used for training. However, you can make several adjustments by using [SSML (Speech Synthesis Markup Language)](./speech-synthesis-markup.md?tabs=csharp) when you make the API calls to your voice model to generate synthetic speech. SSML is the markup language used to communicate with the text to speech service to convert text into audio. The adjustments you can make include change of pitch, rate, intonation, and pronunciation correction. If the voice model is built with multiple styles, you can also use SSML to switch the styles.
5257

5358
## Components sequence

articles/ai-services/speech-service/openai-voices.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,9 @@ OpenAI text to speech voices are available via two model variants: `Neural` and
2222
- `Neural`: Optimized for real-time use cases with the lowest latency, but lower quality than `NeuralHD`.
2323
- `NeuralHD`: Optimized for quality.
2424

25+
For a demonstration of OpenAI voices in Azure OpenAI Studio and Speech Studio, view this [introductory video](https://youtu.be/Ic505XeV3gs).
26+
> [!VIDEO https://www.youtube.com/embed/Ic505XeV3gs]
27+
2528
## Available text to speech voices in Azure AI services
2629

2730
You might ask: If I want to use an OpenAI text to speech voice, should I use it via the Azure OpenAI Service or via Azure AI Speech? What are the scenarios that guide me to use one or the other?

articles/ai-services/speech-service/personal-voice-overview.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -63,6 +63,9 @@ To get started, here's a summary of the steps to create a personal voice:
6363

6464
Once you have a personal voice, you can [use it](./personal-voice-how-to-use.md) to synthesize speech in any of the 91 languages supported across 100+ locales. A locale tag isn't required. Personal voice uses automatic language detection at the sentence level. For more information, see [use personal voice in your application](./personal-voice-how-to-use.md).
6565

66+
> [!TIP]
67+
> Check out the code samples in the [Speech SDK repository on GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/custom-voice/README.md) to see how to use personal voice in your application.
68+
6669
## Reference documentation
6770

6871
> [!div class="nextstepaction"]

0 commit comments

Comments
 (0)