You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[Custom neural voice](custom-neural-voice.md) (Custom models for Speech synthesizing)
35
-
34
+
-[Custom voice](custom-neural-voice.md) - Fine-tuning of text to speech models with custom data.
36
35
37
36
One Speech resource – Storage account combination can be used for all four scenarios simultaneously in all combinations.
38
37
@@ -436,7 +435,7 @@ For more information, see [Prevent anonymous public read access to containers an
436
435
437
436
**Configure Azure Storage firewall**
438
437
439
-
Custom neural voice uses [User delegation SAS](/azure/storage/common/storage-sas-overview#user-delegation-sas) to read the data for professional voice fine-tuning. It requires allowing external network traffic access to the Storage account.
438
+
Custom voice uses [User delegation SAS](/azure/storage/common/storage-sas-overview#user-delegation-sas) to read the data for professional voice fine-tuning. It requires allowing external network traffic access to the Storage account.
440
439
441
440
1. Go to the [Azure portal](https://portal.azure.com/) and sign in to your Azure account.
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/call-center-overview.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -47,7 +47,7 @@ You might want to further customize and fine-tune the experience for your produc
47
47
| Speech customization | Description |
48
48
| -------------- | ----------- |
49
49
|[Custom speech](./custom-speech-overview.md)| A speech to text feature used to evaluate and improve the speech recognition accuracy of use-case specific entities (such as alpha-numeric customer, case, and contract IDs, license plates, and names). You can also train a custom model with your own product names and industry terminology. |
50
-
|[Custom neural voice](./custom-neural-voice.md)| A text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. |
50
+
|[Custom voice](./custom-neural-voice.md)| A text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. |
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/custom-neural-voice.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,7 +12,7 @@ ms.author: eur
12
12
13
13
# What is custom voice?
14
14
15
-
Custom voice is a text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With custom voice, you can build a highly natural-sounding voice for your brand or characters by providing human speech samples as training data.
15
+
Custom voice is a text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. With custom voice, you can build a highly natural-sounding voice for your brand or characters by providing human speech samples as fine-tuning data.
16
16
17
17
> [!IMPORTANT]
18
18
> Custom voice access is [limited](/legal/cognitive-services/speech-service/custom-neural-voice/limited-access-custom-neural-voice?context=%2fazure%2fcognitive-services%2fspeech-service%2fcontext%2fcontext) based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).
@@ -35,8 +35,8 @@ Before you get started in Speech Studio, here are some considerations:
35
35
Here's an overview of the steps to create a custom voice in Speech Studio:
36
36
37
37
1.[Create a project](professional-voice-create-project.md) to contain your data, voice models, tests, and endpoints. Each project is specific to a country/region and language. If you're going to create multiple voices, it's recommended that you create a project for each voice.
38
-
1.[Set up voice talent](professional-voice-create-project.md). Before you can train a neural voice, you must submit a recording of the voice talent's consent statement. The voice talent statement is a recording of the voice talent reading a statement that they consent to the usage of their speech data to train a custom voice model.
39
-
1.[Prepare training data](professional-voice-create-training-set.md) in the right [format](how-to-custom-voice-training-data.md). It's a good idea to capture the audio recordings in a professional quality recording studio to achieve a high signal-to-noise ratio. The quality of the voice model depends heavily on your training data. Consistent volume, speaking rate, pitch, and consistency in expressive mannerisms of speech are required.
38
+
1.[Set up voice talent](professional-voice-create-project.md). Before you can fine-tune a professional voice, you must submit a recording of the voice talent's consent statement. The voice talent statement is a recording of the voice talent reading a statement that they consent to the usage of their speech data for professional voice fine-tuning.
39
+
1.[Prepare fine-tuning data](professional-voice-create-training-set.md) in the right [format](how-to-custom-voice-training-data.md). It's a good idea to capture the audio recordings in a professional quality recording studio to achieve a high signal-to-noise ratio. The quality of the voice model depends heavily on your fine-tuning data. Consistent volume, speaking rate, pitch, and consistency in expressive mannerisms of speech are required.
40
40
1.[Train your voice model](professional-voice-train-voice.md). Select at least 300 utterances to create a custom voice. A series of data quality checks are automatically performed when you upload them. To build high-quality voice models, you should fix any errors and submit again.
41
41
1.[Test your voice](professional-voice-train-voice.md#test-your-voice-model). Prepare test scripts for your voice model that cover the different use cases for your apps. It’s a good idea to use scripts within and outside the training dataset, so you can test the quality more broadly for different content.
42
42
1.[Deploy and use your voice model](professional-voice-deploy-endpoint.md) in your apps.
@@ -77,5 +77,5 @@ An AI system includes not only the technology, but also the people who use it, t
77
77
## Next steps
78
78
79
79
*[Create a project](professional-voice-create-project.md)
80
-
*[Prepare training data](professional-voice-create-training-set.md)
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/gaming-concepts.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -20,7 +20,7 @@ Here are a few Speech features to consider for flexible and interactive game exp
20
20
- Make the game more accessible for players who are unable to read text in a particular language, including young players who don't read or write. Players can listen to storylines and instructions in their preferred language.
21
21
- Create game avatars and nonplayable characters (NPC) that can initiate or participate in a conversation in-game.
22
22
- Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices.
23
-
- Custom neural voice for creating a voice that stays on-brand with consistent quality and speaking style. You can add emotions, accents, nuances, laughter, and other para linguistic sounds and expressions.
23
+
- Custom voice for creating a voice that stays on-brand with consistent quality and speaking style. You can add emotions, accents, nuances, laughter, and other para linguistic sounds and expressions.
24
24
- Use game dialogue prototyping to shorten the amount of time and money spent in product to get the game to market sooner. You can rapidly swap lines of dialog and listen to variations in real-time to iterate the game content.
25
25
26
26
You can use the [Speech SDK](speech-sdk.md) or [Speech CLI](spx-overview.md) for real-time low latency speech to text, text to speech, language identification, and speech translation. You can also use the [Batch transcription API](batch-transcription.md) to transcribe prerecorded speech to text. To synthesize a large volume of text input (long and short) to speech, use the [Batch synthesis API](batch-synthesis.md).
@@ -29,7 +29,7 @@ For information about locale and regional availability, see [Language and voice
29
29
30
30
## Text to speech
31
31
32
-
Help bring everyone into the conversation by converting text messages to audio using [Text to speech](text-to-speech.md) for scenarios, such as game dialogue prototyping, greater accessibility, or nonplayable character (NPC) voices. Text to speech includes [standard voice](language-support.md?tabs=tts#standard-voices) and [custom voice](language-support.md?tabs=tts#custom-neural-voice) features. Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom neural voice is an easy-to-use self-service for creating a highly natural custom voice.
32
+
Help bring everyone into the conversation by converting text messages to audio using [Text to speech](text-to-speech.md) for scenarios, such as game dialogue prototyping, greater accessibility, or nonplayable character (NPC) voices. Text to speech includes [standard voice](language-support.md?tabs=tts#standard-voices) and [custom voice](language-support.md?tabs=tts#custom-voice) features. Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom voice is an easy-to-use self-service for creating a highly natural custom voice.
33
33
34
34
When enabling this functionality in your game, keep in mind the following benefits:
0 commit comments