Skip to content

Commit cffdb70

Browse files
authored
Update record-custom-voice-samples.md
1 parent 267c707 commit cffdb70

File tree

1 file changed

+6
-5
lines changed

1 file changed

+6
-5
lines changed

articles/ai-services/speech-service/record-custom-voice-samples.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ ms.author: eur
1313

1414
# Recording voice samples for custom neural voice
1515

16-
This article provides you with instructions on preparing high-quality voice samples for creating a professional voice model using the custom neural voice Pro project.
16+
This article provides you with instructions on preparing high-quality voice samples for creating a professional voice model using the custom neural voice Pro project. To understand how the data is processed and the minimum requirements for data acceptance, please refer to [upload your data](professional-voice-create-training-set.md#upload-your-data).
1717

1818
Creating a high-quality production custom neural voice from scratch isn't a casual undertaking. The central component of a custom neural voice is a large collection of audio samples of human speech. It's vital that these audio recordings be of high quality. Choose a voice talent who has experience making these kinds of recordings, and have them recorded by a recording engineer using professional equipment.
1919

@@ -74,9 +74,10 @@ We provide [sample scripts in the 'General', 'Chat' and 'Customer Service' domai
7474

7575
Below are some general guidelines that you can follow to create a good corpus (recorded audio samples) for custom neural voice training.
7676

77-
- Balance your script to cover different sentence types in your domain including statements, questions, exclamations, long sentences, and short sentences.
78-
79-
Each sentence is recommended to be between 2 and 15 seconds long, with 5 to 30 words for Latin-based languages or 4 to 80 words for non-Latin languages. Ensure your script does not include any duplicate sentences.<br>
77+
- For most use cases, sentences are recommended to be between 2 and 15 seconds long, containing 5 to 30 words for Latin-based languages or 4 to 80 words for non-Latin languages. Aim to balance your script to include a variety of sentence types and lengths. Ensure your script does not include any duplicate sentences.<br>
78+
79+
If your use case requires a high emphasis on questions, exclamations, or a mix of particularly long and short sentences, it is recommended to include a good portion of sentences as questions or exclamations, along with very short phrases and longer phrases up to 20 seconds in length.
80+
8081
For how to balance the different sentence types, refer to the following table:
8182

8283
| Sentence types | Coverage |
@@ -89,7 +90,7 @@ Below are some general guidelines that you can follow to create a good corpus (r
8990
> [!NOTE]
9091
> Short words/phrases should be separated with a commas. They help remind your voice talent to pause briefly when reading them.
9192
>
92-
> You can estimate the number of words in a sentence by assuming a speech rate in words per second based on your language. And the range can be extended to 1 to 100 words to better accommodate short or long sentence scenarios.
93+
> You can estimate the number of words in a sentence by assuming a speech rate in words per second based on your language.
9394
>
9495
> Question and exclamations are required if you want the generated voice to accurately convey questions or exclamations.
9596

0 commit comments

Comments
 (0)