You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/custom-neural-voice-lite.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,7 +43,7 @@ To create a custom voice lite project, follow these steps:
43
43
1. Sign in to the [Speech Studio](https://aka.ms/speechstudio/customvoice).
44
44
1. Select the subscription and Speech resource to work with.
45
45
1. Select **Custom voice** > **Create a project**.
46
-
1. Select **Custom voice lite** > **Next**. To create a custom voice professional project instead, see [Create a project for custom voice](professional-voice-create-project.md).
46
+
1. Select **Custom neural voice lite** > **Next**. To create a custom voice professional project instead, see the [professional voice fine-tuning documentation](professional-voice-create-project.md).
47
47
48
48
1. Follow the instructions provided by the wizard to create your project.
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/faq-tts.yml
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -78,7 +78,7 @@ sections:
78
78
- question: |
79
79
How does cross-lingual voice work with languages that have different pronunciation structure and assembly?
80
80
answer: |
81
-
Sentence structure and pronunciation naturally vary across languages such as English and Japanese. Each neural voice is trained with audio data recorded by native speaking voice talent. For [cross lingual](professional-voice-train-voice.md?tabs=crosslingual#train-your-custom-neural-voice-model) voice, we transfer the major features like timbre to sound like the original speaker and preserve the right pronunciation. For example, a cross-lingual voice uses the native way to speak Japanese and still sounds similar (but not exactly) like the original English speaker.
81
+
Sentence structure and pronunciation naturally vary across languages such as English and Japanese. Each neural voice is trained with audio data recorded by native speaking voice talent. For [cross lingual](professional-voice-train-voice.md?tabs=crosslingual#train-your-custom-voice-model) voice, we transfer the major features like timbre to sound like the original speaker and preserve the right pronunciation. For example, a cross-lingual voice uses the native way to speak Japanese and still sounds similar (but not exactly) like the original English speaker.
82
82
- question: |
83
83
Can I use professional voice fine-tuning to customize pronunciation for my domain?
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/gaming-concepts.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,7 +29,7 @@ For information about locale and regional availability, see [Language and voice
29
29
30
30
## Text to speech
31
31
32
-
Help bring everyone into the conversation by converting text messages to audio using [Text to speech](text-to-speech.md) for scenarios, such as game dialogue prototyping, greater accessibility, or nonplayable character (NPC) voices. Text to speech includes [standard voice](language-support.md?tabs=tts#standard-voices) and [custom voice](language-support.md?tabs=tts#custom-voice) features. Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom voice is an easy-to-use self-service for creating a highly natural custom voice.
32
+
Help bring everyone into the conversation by converting text messages to audio using [Text to speech](text-to-speech.md) for scenarios, such as game dialogue prototyping, greater accessibility, or nonplayable character (NPC) voices. Text to speech includes [standard voice](language-support.md?tabs=tts#standard-voices) and [custom voice](language-support.md?tabs=tts#professional-voice) features. Standard voice can provide highly natural out-of-box voices with leading voice variety in terms of a large portfolio of languages and voices. Custom voice is an easy-to-use self-service for creating a highly natural custom voice.
33
33
34
34
When enabling this functionality in your game, keep in mind the following benefits:
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/includes/how-to/professional-voice/train-voice/ai-foundry.md
+10-10Lines changed: 10 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -23,17 +23,17 @@ Training duration varies depending on how much data you use. It takes about 40 c
23
23
24
24
After you validate your data files, use them to build your custom voice model. When you create a custom voice, you can choose to train it with one of the following methods:
25
25
26
-
-[Neural](?tabs=neural#train-your-custom-neural-voice-model): Create a voice in the same language of your training data.
26
+
-[Neural](?tabs=neural#train-your-custom-voice-model): Create a voice in the same language of your training data.
27
27
28
-
-[Neural - cross lingual](?tabs=crosslingual#train-your-custom-neural-voice-model): Create a voice that speaks a different language from your training data. For example, with the `zh-CN` training data, you can create a voice that speaks `en-US`.
28
+
-[Neural - cross lingual](?tabs=crosslingual#train-your-custom-voice-model): Create a voice that speaks a different language from your training data. For example, with the `zh-CN` training data, you can create a voice that speaks `en-US`.
29
29
30
30
The language of the training data and the target language must both be one of the [languages that are supported](../../../../language-support.md?tabs=tts#custom-voice) for cross lingual voice training. You don't need to prepare training data in the target language, but your test script must be in the target language.
31
31
32
-
-[Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model): Create a custom voice that speaks in multiple styles and emotions, without adding new training data. Multiple style voices are useful for video game characters, conversational chatbots, audiobooks, content readers, and more.
32
+
-[Neural - multi style](?tabs=multistyle#train-your-custom-voice-model): Create a custom voice that speaks in multiple styles and emotions, without adding new training data. Multiple style voices are useful for video game characters, conversational chatbots, audiobooks, content readers, and more.
33
33
34
34
To create a multiple style voice, you need to prepare a set of general training data, at least 300 utterances. Select one or more of the preset target speaking styles. You can also create multiple custom styles by providing style samples, of at least 100 utterances per style, as extra training data for the same voice. The supported preset styles vary according to different languages. See [available preset styles across different languages](?tabs=multistyle#available-preset-styles-across-different-languages).
35
35
36
-
-[Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-neural-voice-model): Create a voice that speaks multiple languages using the single-language training data. For example, with the `en-US` primary training data, you can create a voice that speaks `en-US`, `de-DE`, `zh-CN` etc. secondary languages.
36
+
-[Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-voice-model): Create a voice that speaks multiple languages using the single-language training data. For example, with the `en-US` primary training data, you can create a voice that speaks `en-US`, `de-DE`, `zh-CN` etc. secondary languages.
37
37
38
38
The primary language of the training data and the secondary languages must be in the [languages that are supported](../../../../language-support.md?tabs=tts#custom-voice) for multi lingual voice training. You don't need to prepare training data in the secondary languages.
39
39
@@ -49,7 +49,7 @@ To create a custom voice in Azure AI Foundry portal, follow these steps for one
49
49
1. Select **Fine-tuning** from the left pane and then select **AI Service fine-tuning**.
50
50
1. Select the professional voice fine-tuning task (by model name) that you [started as described in the create professional voice article](/azure/ai-services/speech-service/professional-voice-create-project).
51
51
1. Select **Train model** > **+ Train model**.
52
-
1. Select **Neural** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural - cross lingual](?tabs=crosslingual#train-your-custom-neural-voice-model), [Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model), [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-neural-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-neural-voice-model).
52
+
1. Select **Neural** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural - cross lingual](?tabs=crosslingual#train-your-custom-voice-model), [Neural - multi style](?tabs=multistyle#train-your-custom-voice-model), [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-voice-model).
53
53
54
54
:::image type="content" source="../../../../media/custom-voice/professional-voice/cnv-train-neural.png" alt-text="Screenshot that shows how to select neural training." lightbox="../../../../media/custom-voice/professional-voice/cnv-train-neural.png":::
55
55
@@ -82,7 +82,7 @@ To create a custom voice in Azure AI Foundry portal, follow these steps for one
82
82
1. Select **Fine-tuning** from the left pane and then select **AI Service fine-tuning**.
83
83
1. Select the professional voice fine-tuning task (by model name) that you [started as described in the create professional voice article](/azure/ai-services/speech-service/professional-voice-create-project).
84
84
1. Select **Train model** > **+ Train model**.
85
-
1. Select **Neural - Cross lingual** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-neural-voice-model), [Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model), [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-neural-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-neural-voice-model).
85
+
1. Select **Neural - Cross lingual** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-voice-model), [Neural - multi style](?tabs=multistyle#train-your-custom-voice-model), [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-voice-model).
86
86
87
87
:::image type="content" source="../../../../media/custom-voice/professional-voice/cnv-train-neural-cross-lingual.png" alt-text="Screenshot that shows how to select neural cross lingual training." lightbox="../../../../media/custom-voice/professional-voice/cnv-train-neural-cross-lingual.png":::
88
88
@@ -111,7 +111,7 @@ To create a custom voice in Azure AI Foundry portal, follow these steps for one
111
111
1. Select **Fine-tuning** from the left pane and then select **AI Service fine-tuning**.
112
112
1. Select the professional voice fine-tuning task (by model name) that you [started as described in the create professional voice article](/azure/ai-services/speech-service/professional-voice-create-project).
113
113
1. Select **Train model** > **+ Train model**.
114
-
1. Select **Neural - multi style** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-neural-voice-model), [Neural - cross lingual](?tabs=crosslingual#train-your-custom-neural-voice-model), [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-neural-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-neural-voice-model).
114
+
1. Select **Neural - multi style** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-voice-model), [Neural - cross lingual](?tabs=crosslingual#train-your-custom-voice-model), [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-voice-model).
115
115
116
116
:::image type="content" source="../../../../media/custom-voice/professional-voice/cnv-train-neural-multi-style.png" alt-text="Screenshot that shows how to select neural multi style training." lightbox="../../../../media/custom-voice/professional-voice/cnv-train-neural-multi-style.png":::
117
117
@@ -154,7 +154,7 @@ The following table summarizes the different preset styles according to differen
154
154
1. Select **Fine-tuning** from the left pane and then select **AI Service fine-tuning**.
155
155
1. Select the professional voice fine-tuning task (by model name) that you [started as described in the create professional voice article](/azure/ai-services/speech-service/professional-voice-create-project).
156
156
1. Select **Train model** > **+ Train model**.
157
-
1. Select **Neural - multi lingual** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-neural-voice-model), [Neural - cross lingual](?tabs=crosslingual#train-your-custom-neural-voice-model), [Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-neural-voice-model).
157
+
1. Select **Neural - multi lingual** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-voice-model), [Neural - cross lingual](?tabs=crosslingual#train-your-custom-voice-model), [Neural - multi style](?tabs=multistyle#train-your-custom-voice-model), or [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-voice-model).
158
158
159
159
:::image type="content" source="../../../../media/custom-voice/professional-voice/cnv-train-neural-multi-lingual.png" alt-text="Screenshot that shows how to select neural multi lingual training." lightbox="../../../../media/custom-voice/professional-voice/cnv-train-neural-multi-lingual.png":::
160
160
@@ -178,7 +178,7 @@ The following table summarizes the different preset styles according to differen
178
178
179
179
# [Neural - HD Voice (preview)](#tab/hdvoice)
180
180
181
-
HD voice training isn't yet supported in the Azure AI Foundry portal. To train an HD voice, you must use the [Speech Studio](https://speech.microsoft.com/portal). For more information, see [the corresponding Speech Studio documentation](?tabs=hdvoice&pivots=speech-studio#train-your-custom-neural-voice-model).
181
+
HD voice training isn't yet supported in the Azure AI Foundry portal. To train an HD voice, you must use the [Speech Studio](https://speech.microsoft.com/portal). For more information, see [the corresponding Speech Studio documentation](?tabs=hdvoice&pivots=speech-studio#train-your-custom-voice-model).
182
182
183
183
---
184
184
@@ -213,7 +213,7 @@ You have to clone your model to rename it. You can't rename the model directly.
213
213
After your voice model is successfully built, you can use the generated sample audio files to test it before you deploy it.
214
214
215
215
> [!NOTE]
216
-
> [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-neural-voice-model) and [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-neural-voice-model) do not support this type of testing.
216
+
> [Neural - multi lingual (preview)](?tabs=multilingual#train-your-custom-voice-model) and [Neural - HD Voice (preview)](?tabs=hdvoice#train-your-custom-voice-model) do not support this type of testing.
217
217
218
218
The quality of the voice depends on many factors, such as:
0 commit comments