Skip to content

Commit 3373a70

Browse files
Acro score
1 parent 0988c37 commit 3373a70

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

articles/azure-video-indexer/customize-speech-model-overview.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ author: IngridAtMicrosoft
1313

1414
Through Azure AI Video Indexer integration with [Azure AI Speech services](../ai-services/speech-service/captioning-concepts.md), a Universal Language Model is utilized as a base model that is trained with Microsoft-owned data and reflects commonly used spoken language. The base model is pretrained with dialects and phonetics representing various common domains. The base model works well in most speech recognition scenarios.
1515

16-
However, sometimes the base model’s transcription doesn't accurately handle some content. In these situations, a customized speech model can be used to improve recognition of domain-specific vocabulary or pronunciation that is specific to your content by providing text data to train the model. Through the process of creating and adapting speech customization models, your content can be properly transcribed. There is no additional charge for using Video Indexers speech customization.
16+
However, sometimes the base model’s transcription doesn't accurately handle some content. In these situations, a customized speech model can be used to improve recognition of domain-specific vocabulary or pronunciation that is specific to your content by providing text data to train the model. Through the process of creating and adapting speech customization models, your content can be properly transcribed. There's no additional charge for using Video Indexers speech customization.
1717

1818
## When to use a customized speech model?
1919

@@ -53,8 +53,8 @@ A dataset including plain text sentences of related text can be used to improve
5353
- Try to have each sentence or keyword on a separate line.
5454
- To increase the weight of a term such as product names, add several sentences that include the term.
5555
- For common phrases that are used in your content, providing many examples is useful because it tells the system to listen for these terms. 
56-
- Avoid including uncommon symbols (~, # @ % &) as they'll get discarded. The sentences in which they appear will also get discarded.
57-
- Avoid putting too large inputs, such as hundreds of thousands of sentences, because doing so will dilute the effect of boosting.
56+
- Avoid including uncommon symbols (~, # @ % &) as get discarded. The sentences in which they appear also get discarded.
57+
- Avoid putting too large inputs, such as hundreds of thousands of sentences, because doing so dilutes the effect of boosting.
5858

5959
Use this table to ensure that your plain text dataset file is formatted correctly:
6060

0 commit comments

Comments
 (0)