You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/includes/how-to/professional-voice/train-voice/bilingual-training.md
+3-5Lines changed: 3 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,14 +14,12 @@ If you selected the [Neural](?tabs=neural) training type, you can train a voice
14
14
> [!NOTE]
15
15
> To enable a voice in the `zh-CN` locale to speak English with the same accent as the sample data, you should choose `Chinese (Mandarin, Simplified), English bilingual` when creating a project or specify the `zh-CN (English bilingual)` locale for the training set data via REST API.
16
16
17
-
If you want the voice to speak English with the same accent as the sample data, then at least 10% of the training set data must be in English. Moreover, the 10% threshold is calculated based on the data accepted after successful uploading, not the data before uploading. If some uploaded English data is rejected due to defects and doesn't meet the 10% threshold, the synthesized voice will default to an English native accent.
18
-
19
17
The following table shows the differences between the two locales:
20
18
21
19
| Speech Studio locale | REST API locale | Bilingual support |
|`Chinese (Mandarin, Simplified)`|`zh-CN`|If your sample data includes English, the synthesized voice will speak English with a native accent, instead of the same accent as the sample data, regardless of the amount of English data. |
24
-
|`Chinese (Mandarin, Simplified), English bilingual`|`zh-CN (English bilingual)`|This option requires providing over 10% English data in your training set to ensure the synthesized voice can speak English with the same accent as the sample data. Otherwise, if the English data is less than 10% in your sample data, the synthesized voice will default to an English native accent. |
25
-
|`Chinese (Taiwanese Mandarin, Traditional)`|`zh-TW`| If you want to train a synthesized voice capable of speaking English with the same accent as your sample data, make sure to provide over 10% English data in your training set. Otherwise, it will default to an English native accent. |
21
+
|`Chinese (Mandarin, Simplified)`|`zh-CN`|If your sample data includes English, the synthesized voice will speak English with an English native accent, instead of the same accent as the sample data, regardless of the amount of English data. |
22
+
|`Chinese (Mandarin, Simplified), English bilingual`|`zh-CN (English bilingual)`|If you want the synthesized voice to speak English with the same accent as the sample data, we recommend including over 10% English data in your training set. Otherwise, the English speaking accent may not be ideal.|
23
+
|`Chinese (Taiwanese Mandarin, Traditional)`|`zh-TW`| If you want to train a synthesized voice capable of speaking English with the same accent as your sample data, make sure to provide over 10% English data in your training set. Otherwise, it will default to an English native accent. The 10% threshold is calculated based on the data accepted after successful uploading, not the data before uploading. If some uploaded English data is rejected due to defects and doesn't meet the 10% threshold, the synthesized voice will default to an English native accent. |
0 commit comments