Skip to content

Commit ee5e079

Browse files
Update record-custom-voice-samples.md
1 parent 6955cde commit ee5e079

File tree

1 file changed

+9
-1
lines changed

1 file changed

+9
-1
lines changed

articles/cognitive-services/Speech-Service/record-custom-voice-samples.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,15 @@ Creating a high-quality production custom neural voice from scratch isn't a casu
2020

2121
Before you can make these recordings, though, you need a script: the words that will be spoken by your voice talent to create the audio samples.
2222

23-
Many small but important details go into creating a professional voice recording. This guide is a roadmap for a process that will help you get good, consistent results.
23+
Many small but important details go into creating a professional voice recording. This guide is a roadmap for a process that will help you get good, consistent results.
24+
25+
## Tips for preparing data for a high-quality voice
26+
27+
The quality of your voice model trained heavily depends on the quality of your training data. In the same training set, consistent volume, speaking rate, speaking pitch, and speaking style are essential to create a great custom neural voice. You should also avoid background noise in the recording and make sure the script and recording match. To ensure the quality of your data, you need to follow [script selection criteria](#script-selection-criteria) and [recording requirements](#recording-your-script).
28+
29+
With the premise of ensuring data quality,in most cases we recommend that you usually prepare at least 500 utterances to build a reasonable custom neural voice that sounds natural. The more training data you prepare, the higher quality of voice you can get.
30+
31+
In some cases, you may want a voice persona with unique characteristics. For example, a cartoon persona needs a voice with a special speaking style, or a voice that is very dynamic in intonation. For this voice, we recommend that you prepare at least 1000 (preferably 2000) utterances, and record them at a professional recording studio. To learn more about how to improve the quality of your voice model, see [characteristics and limitations for using Custom Neural Voice](/legal/cognitive-services/speech-service/custom-neural-voice/characteristics-and-limitations-custom-neural-voice?context=/azure/cognitive-services/speech-service/context/context).
2432

2533
## Voice recording roles
2634

0 commit comments

Comments
 (0)