Skip to content

Commit a644149

Browse files
authored
Merge pull request #99299 from ArcherAZ/patch-28
Update how-to-custom-speech-test-data.md
2 parents 6e2b632 + e931cb7 commit a644149

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

articles/cognitive-services/Speech-Service/how-to-custom-speech-test-data.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -112,14 +112,14 @@ Product names or features that are unique, should include related text data for
112112

113113
| Data type | How this data improves recognition |
114114
|-----------|------------------------------------|
115-
| Utterances (sentences) | Improve accuracy when recognizing product names, or industry-specific vocabulary within the context of a sentence. |
115+
| Sentences (utterances) | Improve accuracy when recognizing product names, or industry-specific vocabulary within the context of a sentence. |
116116
| Pronunciations | Improve pronunciation of uncommon terms, acronyms, or other words with undefined pronunciations. |
117117

118-
Utterances can be provided as a single text file or multiple text files. To improve accuracy, use text data that is closer to the expected spoken utterances. Pronunciations should be provided as a single text file. Everything can be packaged as a single zip file and uploaded to the <a href="https://speech.microsoft.com/customspeech" target="_blank">Custom Speech portal <span class="docon docon-navigate-external x-hidden-focus"></span></a>.
118+
Sentences can be provided as a single text file or multiple text files. To improve accuracy, use text data that is closer to the expected spoken utterances. Pronunciations should be provided as a single text file. Everything can be packaged as a single zip file and uploaded to the <a href="https://speech.microsoft.com/customspeech" target="_blank">Custom Speech portal <span class="docon docon-navigate-external x-hidden-focus"></span></a>.
119119

120-
### Guidelines to create an utterances file
120+
### Guidelines to create a sentences file
121121

122-
To create a custom model using related text, you'll need to provide a list of sample utterances. Utterances _do not_ need to be complete sentences or grammatically correct, but they must accurately reflect the spoken input you expect in production. If you want certain terms to have increased weight, add several sentences that include these specific terms.
122+
To create a custom model using sentences, you'll need to provide a list of sample utterances. Utterances _do not_ need to be complete or grammatically correct, but they must accurately reflect the spoken input you expect in production. If you want certain terms to have increased weight, add several sentences that include these specific terms.
123123

124124
As general guidance, model adaptation is most effective when the training text is as close as possible to the real text expected in production. Domain-specific jargon and phrases that you're targeting to enhance, should be included in training text. When possible, try to have one sentence or keyword controlled on a separate line. For keywords and phrases that are important to you (for example, product names), you can copy them a few times. But keep in mind, don't copy too much - it could affect the overall recognition rate.
125125

0 commit comments

Comments
 (0)