Skip to content

Commit 30c18fb

Browse files
Update batch-transcription.md
1 parent cd22dab commit 30c18fb

File tree

1 file changed

+7
-7
lines changed

1 file changed

+7
-7
lines changed

articles/cognitive-services/Speech-Service/batch-transcription.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -99,9 +99,9 @@ For more details, see [Webhooks](webhooks.md).
9999

100100
## Speaker Separation (Diarization)
101101

102-
Diarization is the process of separating speakers in a piece of audio. Our Batch pipeline supports Diarization and is capable of recognizing 2 speakers on mono channel recordings.
102+
Diarization is the process of separating speakers in a piece of audio. Our Batch pipeline supports Diarization and is capable of recognizing two speakers on mono channel recordings.
103103

104-
To request that your audio transcription request is processed for diarization you simply have to add the relevant parameter in the HTTP request as shown below.
104+
To request that your audio transcription request is processed for diarization, you simply have to add the relevant parameter in the HTTP request as shown below.
105105

106106
```json
107107
{
@@ -117,11 +117,11 @@ To request that your audio transcription request is processed for diarization yo
117117
}
118118
```
119119

120-
Note that world level timestamps would also have to be 'turned on' as the parameters in the above request indicate.
120+
Word level timestamps would also have to be 'turned on' as the parameters in the above request indicate.
121121

122-
The corresponding audio will contain the speakers identified by a number (currently we support only 2 voices, so the speakers will be identified as 'Speaker 1 'and 'Speaker 2') followed by the transcription output.
122+
The corresponding audio will contain the speakers identified by a number (currently we support only two voices, so the speakers will be identified as 'Speaker 1 'and 'Speaker 2') followed by the transcription output.
123123

124-
Also note that Diarization is not available in Stereo recordings. Furthermore, all JSON output will contain the Speaker tag. If diarization is not used it will simply show as 'Speaker: Null'
124+
Also note that Diarization is not available in Stereo recordings. Furthermore, all JSON output will contain the Speaker tag. If diarization is not used, it will show 'Speaker: Null' inteh JSON output.
125125

126126
Supported locales are listed below.
127127

@@ -141,7 +141,7 @@ Sentiment is a new feature in Batch Transcription API and is an important featur
141141
4. Pinpoint what went well when turning negative calls to positive
142142
5. Identify what customers like and what they dislike about a product or a service
143143

144-
Sentiment is scored per audio segment where an audio segment is defined as the time lapse between the start of the utterance (offset) and the detection silence of end of byte stream. The entire text within that segment is used to calculate sentiment. We DO NOT calculate any aggregate sentiment values for the entire call or the entire speech of each channel. These are left to the domain owner to further apply.
144+
Sentiment is scored per audio segment where an audio segment is defined as the time lapse between the start of the utterance (offset) and the detection silence of end of byte stream. The entire text within that segment is used to calculate sentiment. We DO NOT calculate any aggregate sentiment values for the entire call or the entire speech of each channel. These aggregations are left to the domain owner to further apply.
145145

146146
Sentiment is applied on the lexical form.
147147

@@ -180,7 +180,7 @@ A JSON output sample looks like below:
180180
]
181181
}
182182
```
183-
The features uses a Sentiment model which is currently in Beta.
183+
The feature uses a Sentiment model, which is currently in Beta.
184184

185185
## Sample code
186186

0 commit comments

Comments
 (0)