Skip to content

Commit 1ef310e

Browse files
committed
acrolinx
1 parent e7fdd56 commit 1ef310e

File tree

1 file changed

+5
-6
lines changed

1 file changed

+5
-6
lines changed

articles/cognitive-services/Speech-Service/display-text-format.md

Lines changed: 5 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ Speech-to-text offers an array of formatting features to ensure that the transcr
1919

2020
## ITN
2121

22-
Inverse Text Normalization (ITN) is a process that converts spoken words into their written form. For example, the spoken word "four" is converted to the written form "4". This process is performed by the speech-to-text service and isn't configurable. Some of the supported text formats include dates, times, decimals, currencies, addresses, emails, and phone numbers. This allows Speech users to speak naturally into their device, and the service formats text as expected. The following table shows the ITN rules that are applied to the text output.
22+
Inverse Text Normalization (ITN) is a process that converts spoken words into their written form. For example, the spoken word "four" is converted to the written form "4". This process is performed by the speech-to-text service and isn't configurable. Some of the supported text formats include dates, times, decimals, currencies, addresses, emails, and phone numbers. You can speak naturally, and the service formats text as expected. The following table shows the ITN rules that are applied to the text output.
2323

2424
|Recognized speech|Display text|
2525
|---|---|
@@ -36,14 +36,13 @@ Speech-to-text models recognize words that should be capitalized to improve read
3636

3737
|Recognized speech|Display text|
3838
|---|---|
39-
|i got an iphone s e|I got an iPhone SE.|
39+
|i got an x l t shirt|I got an XL t-shirt.|
4040
|my name is jennifer smith|My name is Jennifer Smith.|
4141
|i want to visit new york city|I want to visit New York City.|
42-
|i need to service my toyota|I need to service my Toyota.|
4342

4443
## Disfluency removal
4544

46-
When speaking, it is common for someone to stutter, duplicate words, and say filler words like "uhm" or "uh". Speech-to-text can recognize such disfluencies and remove them from the display text. This is great for transcribing live unscripted speeches to read them back later. Some examples are shown in this table.
45+
When speaking, it's common for someone to stutter, duplicate words, and say filler words like "uhm" or "uh". Speech-to-text can recognize such disfluencies and remove them from the display text. Disfluency removal is great for transcribing live unscripted speeches to read them back later. Some examples are shown in this table.
4746

4847
|Recognized speech|Display text|
4948
|---|---|
@@ -53,7 +52,7 @@ When speaking, it is common for someone to stutter, duplicate words, and say fil
5352

5453
## Punctuation
5554

56-
Speech-to-text automatically punctuates your text to improve clarity. This is helpful for reading back call or conversation transcriptions. Some examples are shown in this table.
55+
Speech-to-text automatically punctuates your text to improve clarity. Punctuation is helpful for reading back call or conversation transcriptions. Some examples are shown in this table.
5756

5857
|Recognized speech|Display text|
5958
|---|---|
@@ -120,7 +119,7 @@ You can specify whether to mask, remove, or show profanity in the final transcri
120119
> Microsoft also reserves the right to mask or remove any word that is deemed inappropriate. Such words will not be returned by the Speech service, whether or not you enabled profanity filtering.
121120
122121
The profanity filter options are:
123-
- `Masked`: Replaces letters in profane words with asterisk (*) characters. This is the default option.
122+
- `Masked`: Replaces letters in profane words with asterisk (*) characters. Masked is the default option.
124123
- `Raw`: Include the profane words verbatim.
125124
- `Removed`: Removes profane words.
126125

0 commit comments

Comments
 (0)