Skip to content

Commit 30b0a74

Browse files
Update bilingual-training.md
1 parent 5a72ab6 commit 30b0a74

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

articles/ai-services/speech-service/includes/how-to/professional-voice/train-voice/bilingual-training.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -9,19 +9,19 @@ ms.date: 2/18/2024
99
ms.custom: include
1010
---
1111

12-
If you selected the [Neural](?tabs=neural) training type, you can train a voice to speak in multiple languages. The `zh-CN` and `zh-TW` locales both support bilingual training for the voice to speak both Chinese and English. Depending in part on your training data, the synthesized voice can speak English with an English accent or English with the same native accent as the training data.
12+
If you selected the [Neural](?tabs=neural) training type, you can train a voice to speak in multiple languages. The `zh-CN` and `zh-TW` locales both support bilingual training for the voice to speak both Chinese and English. Depending in part on your training data, the synthesized voice can speak English with an English native accent or English with the same accent as the training data.
1313

1414
> [!NOTE]
15-
> However, for a voice in the `zh-CN` locale to speak English with the native accent, you must select `Chinese (Mandarin, Simplified), English bilingual` (or `zh-CN (English bilingual)` via REST API) when creating a project.
15+
> To enable a voice in the zh-CN locale to speak English with the same accent as the sample data, you should choose `Chinese (Mandarin, Simplified), English bilingual` when creating a project or specify the `zh-CN (English bilingual)` locale for the training set data via REST API.
1616
17-
If you want the voice to speak English with native accent, then at least 10% of the training dataset must be in English. Moreover, the 10% threshold is calculated based on the data accepted after successful uploading, not the data before uploading. If some uploaded English data is rejected due to defects and doesn't meet the 10% threshold, the synthesized voice will default to an English native accent.
17+
If you want the voice to speak English with the same accent as the sample data, then at least 10% of the training set data must be in English. Moreover, the 10% threshold is calculated based on the data accepted after successful uploading, not the data before uploading. If some uploaded English data is rejected due to defects and doesn't meet the 10% threshold, the synthesized voice will default to an English native accent.
1818

1919
The following table shows the differences between the two locales:
2020

2121
| Speech Studio locale | REST API locale | Bilingual support |
2222
|:------------- |:------- |:-------------------------- |
23-
| `Chinese (Mandarin, Simplified)` | `zh-CN` | English with English accent is the default.<br/><br/>English with native accent isn't available, regardless of your training data. |
24-
| `Chinese (Mandarin, Simplified), English bilingual` | `zh-CN (English bilingual)` | English with English accent is the default.<br/><br/>If you want the voice to speak English with native accent, then at least 10% of the training dataset must be in English. |
25-
| `Chinese (Taiwanese Mandarin, Traditional)` | `zh-TW` | English with English accent is the default.<br/><br/>If you want the voice to speak English with native accent, then at least 10% of the training dataset must be in English. |
23+
| `Chinese (Mandarin, Simplified)` | `zh-CN` |If your sample data includes English, the synthesized voice will speak English with a native accent, instead of the same accent as the sample data, regardless of the amount of English data. |
24+
| `Chinese (Mandarin, Simplified), English bilingual` | `zh-CN (English bilingual)` |This option requires providing over 10% English data in your training set to ensure the synthesized voice can speak English with the same accent as the sample data. Otherwise, if the English data is less than 10% in your sample data, the synthesized voice will default to an English native accent. |
25+
| `Chinese (Taiwanese Mandarin, Traditional)` | `zh-TW` | If you want to train a synthesized voice capable of speaking English with the same accent as your sample data, make sure to provide over 10% English data in your training set. Otherwise, it will default to an English native accent. |
2626

2727

0 commit comments

Comments
 (0)