You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/batch-transcription.md
+40-7Lines changed: 40 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -124,7 +124,7 @@ Use these optional properties to configure transcription:
124
124
`AddSentiment`
125
125
:::column-end:::
126
126
:::column span="2":::
127
-
Specifies if sentiment analysis should be applied to the utterance. Accepted values are `true` to enable and `false` (the default value) to disable it.
127
+
Specifies if sentiment analysis should be applied to the utterance. Accepted values are `true` to enable and `false` (the default value) to disable it. See [Sentiment Analysis](#sentiment-analysis) for more detail.
128
128
:::row-end:::
129
129
:::row:::
130
130
:::column span="1":::
@@ -213,12 +213,41 @@ For mono input audio, one transcription result file is being created. For stereo
|`ITN`| Inverse-text-normalized form of the recognized text. Abbreviations ("doctor smith" to "dr smith"), phone numbers, and other transformations are applied. |
220
-
|`MaskedITN`| The ITN form with profanity masking applied. |
221
-
|`Display`| The display form of the recognized text. Added punctuation and capitalization are included. |
216
+
:::row:::
217
+
:::column span="1":::
218
+
**Form**
219
+
:::column-end:::
220
+
:::column span="2":::
221
+
**Content**
222
+
:::row-end:::
223
+
:::row:::
224
+
:::column span="1":::
225
+
`Lexical`
226
+
:::column-end:::
227
+
:::column span="2":::
228
+
The actual words recognized.
229
+
:::row-end:::
230
+
:::row:::
231
+
:::column span="1":::
232
+
`ITN`
233
+
:::column-end:::
234
+
:::column span="2":::
235
+
Inverse-text-normalized form of the recognized text. Abbreviations ("doctor smith" to "dr smith"), phone numbers, and other transformations are applied.
236
+
:::row-end:::
237
+
:::row:::
238
+
:::column span="1":::
239
+
`MaskedITN`
240
+
:::column-end:::
241
+
:::column span="2":::
242
+
The ITN form with profanity masking applied.
243
+
:::row-end:::
244
+
:::row:::
245
+
:::column span="1":::
246
+
`Display`
247
+
:::column-end:::
248
+
:::column span="2":::
249
+
The display form of the recognized text. Added punctuation and capitalization are included.
250
+
:::row-end:::
222
251
223
252
## Speaker separation (Diarization)
224
253
@@ -256,6 +285,10 @@ The sentiment feature estimates the sentiment expressed in the audio. The sentim
256
285
257
286
Sentiment is scored per audio segment based on the lexical form. The entire text within that audio segment is used to calculate sentiment. No aggregate sentiment is being calculated for the entire transcription. Currently sentiment analysis is only available for the english language.
258
287
288
+
> [!NOTE]
289
+
> We recommend using the Microsoft Text Analytics API instead. It offers more advanced features beyond sentiment analysis like key phrase extraction, automatic language detection, and more. You can find information and samples in the [Text Analytics documentation](https://azure.microsoft.com/services/cognitive-services/text-analytics/).
0 commit comments