Skip to content

Commit 78a82e3

Browse files
committed
custom voice terminology update
1 parent 007ac0a commit 78a82e3

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

41 files changed

+207
-214
lines changed

articles/ai-services/speech-service/bring-your-own-storage-speech-resource.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -26,13 +26,12 @@ BYOS can be used with several Azure AI services. For Speech, it can be used in t
2626

2727
- [Batch transcription](batch-transcription.md)
2828
- Real-time transcription with [audio and transcription result logging](logging-audio-transcription.md) enabled
29-
- [Custom speech](custom-speech-overview.md) (Custom models for Speech recognition)
29+
- [Custom speech](custom-speech-overview.md) - Fine-tuning of speech to text models with custom data.
3030

3131
**Text to speech**
3232

3333
- [Audio Content Creation](how-to-audio-content-creation.md)
34-
- [Custom neural voice](custom-neural-voice.md) (Custom models for Speech synthesizing)
35-
34+
- [Custom voice](custom-neural-voice.md) - Fine-tuning of text to speech models with custom data.
3635

3736
One Speech resource – Storage account combination can be used for all four scenarios simultaneously in all combinations.
3837

@@ -436,7 +435,7 @@ For more information, see [Prevent anonymous public read access to containers an
436435

437436
**Configure Azure Storage firewall**
438437

439-
Custom neural voice uses [User delegation SAS](/azure/storage/common/storage-sas-overview#user-delegation-sas) to read the data for professional voice fine-tuning. It requires allowing external network traffic access to the Storage account.
438+
Custom voice uses [User delegation SAS](/azure/storage/common/storage-sas-overview#user-delegation-sas) to read the data for professional voice fine-tuning. It requires allowing external network traffic access to the Storage account.
440439

441440
1. Go to the [Azure portal](https://portal.azure.com/) and sign in to your Azure account.
442441
1. Select the Storage account.

articles/ai-services/speech-service/call-center-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ You might want to further customize and fine-tune the experience for your produc
4747
| Speech customization | Description |
4848
| -------------- | ----------- |
4949
| [Custom speech](./custom-speech-overview.md) | A speech to text feature used to evaluate and improve the speech recognition accuracy of use-case specific entities (such as alpha-numeric customer, case, and contract IDs, license plates, and names). You can also train a custom model with your own product names and industry terminology. |
50-
| [Custom neural voice](./custom-neural-voice.md) | A text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. |
50+
| [Custom voice](./custom-neural-voice.md) | A text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. |
5151

5252
### Language service
5353

articles/ai-services/speech-service/custom-neural-voice.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ ms.author: eur
1212

1313
# What is custom voice?
1414

15-
Custom voice is a text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With custom voice, you can build a highly natural-sounding voice for your brand or characters by providing human speech samples as training data.
15+
Custom voice is a text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With custom voice, you can build a highly natural-sounding voice for your brand or characters by providing human speech samples as fine-tuning data.
1616

1717
> [!IMPORTANT]
1818
> Custom voice access is [limited](/legal/cognitive-services/speech-service/custom-neural-voice/limited-access-custom-neural-voice?context=%2fazure%2fcognitive-services%2fspeech-service%2fcontext%2fcontext) based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).
@@ -35,8 +35,8 @@ Before you get started in Speech Studio, here are some considerations:
3535
Here's an overview of the steps to create a custom voice in Speech Studio:
3636

3737
1. [Create a project](professional-voice-create-project.md) to contain your data, voice models, tests, and endpoints. Each project is specific to a country/region and language. If you're going to create multiple voices, it's recommended that you create a project for each voice.
38-
1. [Set up voice talent](professional-voice-create-project.md). Before you can train a neural voice, you must submit a recording of the voice talent's consent statement. The voice talent statement is a recording of the voice talent reading a statement that they consent to the usage of their speech data to train a custom voice model.
39-
1. [Prepare training data](professional-voice-create-training-set.md) in the right [format](how-to-custom-voice-training-data.md). It's a good idea to capture the audio recordings in a professional quality recording studio to achieve a high signal-to-noise ratio. The quality of the voice model depends heavily on your training data. Consistent volume, speaking rate, pitch, and consistency in expressive mannerisms of speech are required.
38+
1. [Set up voice talent](professional-voice-create-project.md). Before you can fine-tune a professional voice, you must submit a recording of the voice talent's consent statement. The voice talent statement is a recording of the voice talent reading a statement that they consent to the usage of their speech data for professional voice fine-tuning.
39+
1. [Prepare fine-tuning data](professional-voice-create-training-set.md) in the right [format](how-to-custom-voice-training-data.md). It's a good idea to capture the audio recordings in a professional quality recording studio to achieve a high signal-to-noise ratio. The quality of the voice model depends heavily on your fine-tuning data. Consistent volume, speaking rate, pitch, and consistency in expressive mannerisms of speech are required.
4040
1. [Train your voice model](professional-voice-train-voice.md). Select at least 300 utterances to create a custom voice. A series of data quality checks are automatically performed when you upload them. To build high-quality voice models, you should fix any errors and submit again.
4141
1. [Test your voice](professional-voice-train-voice.md#test-your-voice-model). Prepare test scripts for your voice model that cover the different use cases for your apps. It’s a good idea to use scripts within and outside the training dataset, so you can test the quality more broadly for different content.
4242
1. [Deploy and use your voice model](professional-voice-deploy-endpoint.md) in your apps.
@@ -77,5 +77,5 @@ An AI system includes not only the technology, but also the people who use it, t
7777
## Next steps
7878

7979
* [Create a project](professional-voice-create-project.md)
80-
* [Prepare training data](professional-voice-create-training-set.md)
80+
* [Prepare fine-tuning data](professional-voice-create-training-set.md)
8181
* [Train model](professional-voice-train-voice.md)

articles/ai-services/speech-service/gaming-concepts.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ Here are a few Speech features to consider for flexible and interactive game exp
2020
- Make the game more accessible for players who are unable to read text in a particular language, including young players who don't read or write. Players can listen to storylines and instructions in their preferred language.
2121
- Create game avatars and nonplayable characters (NPC) that can initiate or participate in a conversation in-game.
2222
- Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices.
23-
- Custom neural voice for creating a voice that stays on-brand with consistent quality and speaking style. You can add emotions, accents, nuances, laughter, and other para linguistic sounds and expressions.
23+
- Custom voice for creating a voice that stays on-brand with consistent quality and speaking style. You can add emotions, accents, nuances, laughter, and other para linguistic sounds and expressions.
2424
- Use game dialogue prototyping to shorten the amount of time and money spent in product to get the game to market sooner. You can rapidly swap lines of dialog and listen to variations in real-time to iterate the game content.
2525

2626
You can use the [Speech SDK](speech-sdk.md) or [Speech CLI](spx-overview.md) for real-time low latency speech to text, text to speech, language identification, and speech translation. You can also use the [Batch transcription API](batch-transcription.md) to transcribe prerecorded speech to text. To synthesize a large volume of text input (long and short) to speech, use the [Batch synthesis API](batch-synthesis.md).
@@ -29,7 +29,7 @@ For information about locale and regional availability, see [Language and voice
2929

3030
## Text to speech
3131

32-
Help bring everyone into the conversation by converting text messages to audio using [Text to speech](text-to-speech.md) for scenarios, such as game dialogue prototyping, greater accessibility, or nonplayable character (NPC) voices. Text to speech includes [standard voice](language-support.md?tabs=tts#standard-voices) and [custom voice](language-support.md?tabs=tts#custom-neural-voice) features. Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom neural voice is an easy-to-use self-service for creating a highly natural custom voice.
32+
Help bring everyone into the conversation by converting text messages to audio using [Text to speech](text-to-speech.md) for scenarios, such as game dialogue prototyping, greater accessibility, or nonplayable character (NPC) voices. Text to speech includes [standard voice](language-support.md?tabs=tts#standard-voices) and [custom voice](language-support.md?tabs=tts#custom-voice) features. Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom voice is an easy-to-use self-service for creating a highly natural custom voice.
3333

3434
When enabling this functionality in your game, keep in mind the following benefits:
3535

0 commit comments

Comments
 (0)