You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/how-to-custom-voice-create-voice.md
+17-17Lines changed: 17 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -35,11 +35,11 @@ After you validate your data files, use them to build your Custom Neural Voice m
35
35
36
36
The language of the training data and the target language must both be one of the [languages that are supported](language-support.md?tabs=tts) for cross lingual voice training. You don't need to prepare training data in the target language, but your test script must be in the target language.
37
37
38
-
-[Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model): Create a custom neural voice that speaks in multiple styles and emotions, without adding new training data. Multi-style voices are useful for video game characters, conversational chatbots, audiobooks, content readers, and more.
38
+
-[Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model): Create a custom neural voice that speaks in multiple styles and emotions, without adding new training data. Multiple style voices are useful for video game characters, conversational chatbots, audiobooks, content readers, and more.
39
39
40
-
To create a multi-style voice, you need to prepare a set of general training data, at least 300 utterances. Select one or more of the preset target speaking styles. You can also create multiple custom styles by providing style samples, of at least 100 utterances per style, as extra training data for the same voice. The supported preset styles vary according to different languages. See [Available preset styles across different languages](?tabs=multistyle#available-preset-styles-across-different-languages).
40
+
To create a multiple style voice, you need to prepare a set of general training data, at least 300 utterances. Select one or more of the preset target speaking styles. You can also create multiple custom styles by providing style samples, of at least 100 utterances per style, as extra training data for the same voice. The supported preset styles vary according to different languages. See [Available preset styles across different languages](?tabs=multistyle#available-preset-styles-across-different-languages).
41
41
42
-
The language of the training data must be one of the [languages that are supported](language-support.md?tabs=tts) for custom neural voice neural, cross-lingual, or multi-style training.
42
+
The language of the training data must be one of the [languages that are supported](language-support.md?tabs=tts) for custom neural voice, cross-lingual, or multiple style training.
43
43
44
44
## Train your Custom Neural Voice model
45
45
@@ -48,15 +48,15 @@ To create a custom neural voice in Speech Studio, follow these steps for one of
48
48
# [Neural](#tab/neural)
49
49
50
50
1. Sign in to the [Speech Studio](https://aka.ms/speechstudio/customvoice).
51
-
1. Select **Custom Voice** > Your project name > **Train model** > **Train a new model**.
51
+
1. Select **Custom Voice** > *\<Your project name>* > **Train model** > **Train a new model**.
52
52
1. Select **Neural** as the [training method](#choose-a-training-method) for your model and then select **Next**. To use a different training method, see [Neural - cross lingual](?tabs=crosslingual#train-your-custom-neural-voice-model) or [Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model).
53
53
54
54
:::image type="content" source="media/custom-voice/cnv-train-neural.png" alt-text="Screenshot that shows how to select neural training.":::
55
55
56
-
1. Select a version of the training recipe for your model. The latest version is selected by default. The supported features and training time can vary by version. Normally, the latest version is recommended for the best results. In some cases, you can choose an older version to reduce training time.
56
+
1. Select a version of the training recipe for your model. The latest version is selected by default. The supported features and training time can vary by version. Normally, we recommend the latest version. In some cases, you can choose an earlier version to reduce training time.
57
57
1. Select the data that you want to use for training. Duplicate audio names are removed from the training. Make sure that the data you select doesn't contain the same audio names across multiple *.zip* files.
58
58
59
-
Only successfully processed datasets can be selected for training. If you don't see your training set in the list, check your data processing status.
59
+
You can select only successfully processed datasets for training. If you don't see your training set in the list, check your data processing status.
60
60
61
61
1. Select a speaker file with the voice talent statement that corresponds to the speaker in your training data.
62
62
1. Select **Next**.
@@ -73,15 +73,15 @@ To create a custom neural voice in Speech Studio, follow these steps for one of
73
73
# [Neural - cross lingual](#tab/crosslingual)
74
74
75
75
1. Sign in to the [Speech Studio](https://aka.ms/speechstudio/customvoice).
76
-
1. Select **Custom Voice** > Your project name > **Train model** > **Train a new model**.
76
+
1. Select **Custom Voice** > *\<Your project name>* > **Train model** > **Train a new model**.
77
77
1. Select **Neural - cross lingual** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-neural-voice-model) or [Neural - multi style](?tabs=multistyle#train-your-custom-neural-voice-model).
78
78
79
79
:::image type="content" source="media/custom-voice/cnv-train-neural-cross-lingual.png" alt-text="Screenshot that shows how to select neural cross lingual training.":::
80
80
81
-
1. Select the **Target language** that is the secondary language for your voice model. Only one target language can be selected for a voice model.
81
+
1. Select the **Target language** that is the secondary language for your voice model. You can select only one target language for a voice model.
82
82
1. Select the data that you want to use for training. Duplicate audio names are removed from the training. Make sure that the data you select doesn't contain the same audio names across multiple *.zip* files.
83
83
84
-
Only successfully processed datasets can be selected for training. Check your data processing status if you don't see your training set in the list.
84
+
You can select only successfully processed datasets for training. Check your data processing status if you don't see your training set in the list.
85
85
86
86
1. Select a speaker file with the voice talent statement that corresponds to the speaker in your training data.
87
87
1. Select **Next**.
@@ -98,15 +98,15 @@ To create a custom neural voice in Speech Studio, follow these steps for one of
98
98
# [Neural - multi style](#tab/multistyle)
99
99
100
100
1. Sign in to the [Speech Studio](https://aka.ms/speechstudio/customvoice).
101
-
1. Select **Custom Voice** > Your project name > **Train model** > **Train a new model**.
101
+
1. Select **Custom Voice** > *\<Your project name>* > **Train model** > **Train a new model**.
102
102
1. Select **Neural - multi style** as the [training method](#choose-a-training-method) for your model. To use a different training method, see [Neural](?tabs=neural#train-your-custom-neural-voice-model) or [Neural - cross lingual](?tabs=crosslingual#train-your-custom-neural-voice-model).
103
103
104
104
:::image type="content" source="media/custom-voice/cnv-train-neural-multi-style.png" alt-text="Screenshot that shows how to select neural multi style training.":::
105
105
106
106
1. Select one or more preset speaking styles to train.
107
107
1. Select the data that you want to use for training. Duplicate audio names are removed from the training. Make sure that the data you select doesn't contain the same audio names across multiple *.zip* files.
108
108
109
-
Only successfully processed datasets can be selected for training. Check your data processing status if you don't see your training set in the list.
109
+
You can select only successfully processed datasets for training. Check your data processing status if you don't see your training set in the list.
110
110
111
111
1. Select **Next**.
112
112
1. Optionally, you can add other custom speaking styles. The maximum number of custom styles varies by languages: `English (United States)` allows up to 10 custom styles, `Chinese (Mandarin, Simplified)` allows up to four custom styles, and `Japanese (Japan)` allows up to five custom styles.
@@ -117,7 +117,7 @@ To create a custom neural voice in Speech Studio, follow these steps for one of
117
117
1. Select **Next**.
118
118
1. Select a speaker file with the voice talent statement that corresponds to the speaker in your training data.
119
119
1. Select **Next**.
120
-
1. Each training generates 100 sample audio files for the default style and 20 for each preset style automatically to help you test the model with a default script.
120
+
1. Each training automatically generates 100 sample audio files for the default style and 20 for each preset style to help you test the model with a default script.
121
121
122
122
Optionally, you can also select **Add my own test script** and provide your own test script with up to 100 utterances to test the default style at no extra cost. The generated audio files are a combination of the automatic test scripts and custom test scripts. For more information, see [test script requirements](#test-script-requirements).
123
123
@@ -166,7 +166,7 @@ While the model status is **Processing**, you can select **Cancel training** to
166
166
167
167
After you finish training the model successfully, you can review the model details and [Test your voice model](#test-your-voice-model).
168
168
169
-
You can use the [Audio Content Creation](how-to-audio-content-creation.md) tool in Speech Studio to create audio and fine-tune your deployed voice. If applicable for your voice, one of multiple styles can also be selected.
169
+
You can use the [Audio Content Creation](how-to-audio-content-creation.md) tool in Speech Studio to create audio and fine-tune your deployed voice. If applicable for your voice, you can select one of multiple styles.
170
170
171
171
### Rename your model
172
172
@@ -180,7 +180,7 @@ You can use the [Audio Content Creation](how-to-audio-content-creation.md) tool
180
180
181
181
### Test your voice model
182
182
183
-
After your voice model is successfully built, you can use the generated sample audio files to test it before deploying it for use.
183
+
After your voice model is successfully built, you can use the generated sample audio files to test it before you deploy it.
184
184
185
185
The quality of the voice depends on many factors, such as:
186
186
@@ -224,7 +224,7 @@ Each paragraph of the utterance results in a separate audio. If you want to comb
224
224
225
225
### Update engine version for your voice model
226
226
227
-
Azure Text to speech engines are updated from time to time to capture the latest language model that defines the pronunciation of the language. After you train your voice, you can apply your voice to the new language model by updating to the latest engine version.
227
+
Azure text to speech engines are updated from time to time to capture the latest language model that defines the pronunciation of the language. After you train your voice, you can apply your voice to the new language model by updating to the latest engine version.
228
228
229
229
1. When a new engine is available, you're prompted to update your neural voice model.
230
230
@@ -240,7 +240,7 @@ Azure Text to speech engines are updated from time to time to capture the latest
240
240
241
241
You're not charged for engine update. The previous versions are still kept.
242
242
243
-
1. You can check all engine versions for the model from **Engine version** list, or remove one if you don't need it anymore.
243
+
1. You can check all engine versions for the model from the **Engine version** list, or remove one if you don't need it anymore.
244
244
245
245
:::image type="content" source="media/custom-voice/cnv-engine-version.png" alt-text="Screenshot of displaying Engine version drop-down list.":::
246
246
@@ -276,7 +276,7 @@ To copy your custom neural voice model to another project:
276
276
:::image type="content" source="media/custom-voice/cnv-model-copy-dialog.png" alt-text="Screenshot of the copy voice model dialog.":::
277
277
278
278
1. Select **Submit** to copy the model.
279
-
1. Select **View model** under the notification message for copy success.
279
+
1. Select **View model** under the notification message for the successful coping.
280
280
281
281
Navigate to the project where you copied the model to [deploy the model copy](how-to-deploy-and-use-endpoint.md).
0 commit comments