Skip to content

Commit a9e490c

Browse files
Merge pull request #12 from eric-urban/eur/patch-8
proposed edits
2 parents e2e131f + 4fcd2c4 commit a9e490c

File tree

1 file changed

+12
-7
lines changed

1 file changed

+12
-7
lines changed

articles/cognitive-services/Speech-Service/record-custom-voice-samples.md

Lines changed: 12 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -107,16 +107,21 @@ Below are some general guidelines that you can follow to create a good corpus (r
107107

108108
- Check the script carefully for errors. If possible, have someone else check it too. When you run through the script with your talent, you'll probably catch a few more mistakes.
109109

110-
### Difference between script for voice talent and script for training
110+
### Difference between voice talent script and training script
111111

112-
The sample scripts we provided on [GitHub](https://github.com/Azure-Samples/Cognitive-Speech-TTS/tree/master/CustomVoice/script) are just defined for voice talent. If you use the sample scripts to upload for training, you must normalize them in their spoken form. The scripts prepared for voice talent need to follow the native reading conventions, such as 50% and $45, while the scripts used for training need to be normalized to make sure that the scripts match the audio content, such as *fifty percent* and *forty-five dollars*. Make sure the scripts used for training match the recordings of your voice talent, especially scripts contaning digits, symbols, abbreviation, date, and time. We provide a few examples of text normalization rules and explain the difference between script for voice talent and script for training.
112+
The training script can differ from the voice talent script, especially for scripts that contain digits, symbols, abbreviations, date, and time. Scripts prepared for the voice talent must follow the native reading conventions, such as 50% and $45. The scripts used for training must be normalized to match the audio recording, such as *fifty percent* and *forty-five dollars*.
113113

114-
| Category |Script for voice talent<br> (non-normalized) | Script for training <br> (normalized) |
114+
> [!NOTE]
115+
> We provide some example scripts for the voice talent on [GitHub](https://github.com/Azure-Samples/Cognitive-Speech-TTS/tree/master/CustomVoice/script). To use the example scripts for training, you must normalize them according to the recordings of your voice talent before uploading the file.
116+
117+
The following table shows the difference between scripts for voice talent and the normalized script for training.
118+
119+
| Category |Voice talent script example | Training script example (normalized) |
115120
| --------- | --------- | --------------------------- |
116-
| Digits |For example, '123'.| Normalize '123' according to the recordings, such as 'one hundred and twenty-three' . |
117-
| Symbols | For example, '50%'.| Normalize '50%' according to the recordings, such as 'fifty percent'. |
118-
| Abbreviation | For example, 'ASAP'.| Normalize 'ASAP' according to the recordings, such as 'as soon as possible'. |
119-
| Date and time | For example, 'in 2008'. | Normalize '2008' according to the recordings, such as 'two thousand and eight'. |
121+
| Digits |123| one hundred and twenty-three |
122+
| Symbols |50%| fifty percent|
123+
| Abbreviation |ASAP| as soon as possible|
124+
| Date and time |March 3rd at 5:00 PM| March third at five PM|
120125

121126
### Typical defects of a script
122127

0 commit comments

Comments
 (0)