You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/azure-video-indexer/customize-speech-model-overview.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,7 +13,7 @@ author: IngridAtMicrosoft
13
13
14
14
Through Azure AI Video Indexer integration with [Azure AI Speech services](../ai-services/speech-service/captioning-concepts.md), a Universal Language Model is utilized as a base model that is trained with Microsoft-owned data and reflects commonly used spoken language. The base model is pretrained with dialects and phonetics representing various common domains. The base model works well in most speech recognition scenarios.
15
15
16
-
However, sometimes the base model’s transcription doesn't accurately handle some content. In these situations, a customized speech model can be used to improve recognition of domain-specific vocabulary or pronunciation that is specific to your content by providing text data to train the model. Through the process of creating and adapting speech customization models, your content can be properly transcribed. There is no additional charge for using Video Indexers speech customization.
16
+
However, sometimes the base model’s transcription doesn't accurately handle some content. In these situations, a customized speech model can be used to improve recognition of domain-specific vocabulary or pronunciation that is specific to your content by providing text data to train the model. Through the process of creating and adapting speech customization models, your content can be properly transcribed. There's no additional charge for using Video Indexers speech customization.
17
17
18
18
## When to use a customized speech model?
19
19
@@ -53,8 +53,8 @@ A dataset including plain text sentences of related text can be used to improve
53
53
- Try to have each sentence or keyword on a separate line.
54
54
- To increase the weight of a term such as product names, add several sentences that include the term.
55
55
- For common phrases that are used in your content, providing many examples is useful because it tells the system to listen for these terms.
56
-
- Avoid including uncommon symbols (~, # @ % &) as they'll get discarded. The sentences in which they appear will also get discarded.
57
-
- Avoid putting too large inputs, such as hundreds of thousands of sentences, because doing so will dilute the effect of boosting.
56
+
- Avoid including uncommon symbols (~, # @ % &) as get discarded. The sentences in which they appear also get discarded.
57
+
- Avoid putting too large inputs, such as hundreds of thousands of sentences, because doing so dilutes the effect of boosting.
58
58
59
59
Use this table to ensure that your plain text dataset file is formatted correctly:
0 commit comments