Skip to content

Commit ace9ba1

Browse files
committed
save progress
1 parent e307115 commit ace9ba1

File tree

5 files changed

+7
-4
lines changed

5 files changed

+7
-4
lines changed

articles/cognitive-services/Speech-Service/batch-transcription-audio-data.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ The batch transcription API supports the following formats:
2828
| MP3 | PCM | 16-bit | 8 kHz or 16 kHz, mono or stereo |
2929
| OGG | OPUS | 16-bit | 8 kHz or 16 kHz, mono or stereo |
3030

31-
For stereo audio streams, the left and right channels are split during the transcription. A JSON result file is created for each channel. To create an ordered final transcript, use the timestamps that are generated per utterance.
31+
For stereo audio streams, the left and right channels are split during the transcription. A JSON result file is created for each input audio file. To create an ordered final transcript, use the timestamps that are generated per utterance.
3232

3333
## Azure Storage for audio files
3434

articles/cognitive-services/Speech-Service/batch-transcription-get.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -340,7 +340,7 @@ Depending in part on the request parameters set when you created the transcripti
340340

341341
|Property|Description|
342342
|--------|-----------|
343-
|`channel`|The channel number of the results. For stereo audio streams, the left and right channels are split during the transcription. A JSON result file is created for each channel.|
343+
|`channel`|The channel number of the results. For stereo audio streams, the left and right channels are split during the transcription. A JSON result file is created for each input audio file.|
344344
|`combinedRecognizedPhrases`|The concatenated results of all phrases for the channel.|
345345
|`confidence`|The confidence value for the recognition.|
346346
|`display`|The display form of the recognized text. Added punctuation and capitalization are included.|

articles/cognitive-services/Speech-Service/migrate-v3-0-to-v3-1.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -105,13 +105,15 @@ The name of each `operationId` in version 3.1 is prefixed with the object name.
105105
|`/webhooks/{id}`|PATCH|[WebHooks_Update](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-1/operations/WebHooks_Update)|[UpdateHook](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/UpdateHook)|
106106

107107
<sup>1</sup> The `/models/{id}/copyto` operation (includes '/') in version 3.0 is replaced by the `/models/{id}:copyto` operation (includes ':') in version 3.1.
108+
108109
<sup>2</sup> The `/webhooks/{id}/ping` operation (includes '/') in version 3.0 is replaced by the `/webhooks/{id}:ping` operation (includes ':') in version 3.1.
110+
109111
<sup>3</sup> The `/webhooks/{id}/test` operation (includes '/') in version 3.0 is replaced by the `/webhooks/{id}:test` operation (includes ':') in version 3.1.
110112

111113
## Batch transcription
112114

113115
> [!WARNING]
114-
> Once you migrate, don't use Speech-to-text REST API v3.0 to retrieve a transcription created via Speech-to-text REST API v3.1. You'll see an error message such as the following: "The API version cannot be used to access this transcription. Please use API version v3.1 or higher."
116+
> Don't use Speech-to-text REST API v3.0 to retrieve a transcription created via Speech-to-text REST API v3.1. You'll see an error message such as the following: "The API version cannot be used to access this transcription. Please use API version v3.1 or higher."
115117
116118
In the [Transcriptions_Create](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-1-preview1/operations/Transcriptions_Create) operation the following three properties are added:
117119
- The `displayFormWordLevelTimestampsEnabled` property can be used to enable the reporting of word-level timestamps on the display form of the transcription results. The results are returned in the `displayPhraseElements` property of the transcription file.

articles/cognitive-services/Speech-Service/rest-speech-to-text.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -187,6 +187,7 @@ This table includes all the web hook operations that are available with the spee
187187
|`/webhooks/{id}`|PATCH|[WebHooks_Update](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-1/operations/WebHooks_Update)|[UpdateHook](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/UpdateHook)|
188188

189189
<sup>1</sup> The `/webhooks/{id}/ping` operation (includes '/') in version 3.0 is replaced by the `/webhooks/{id}:ping` operation (includes ':') in version 3.1.
190+
190191
<sup>2</sup> The `/webhooks/{id}/test` operation (includes '/') in version 3.0 is replaced by the `/webhooks/{id}:test` operation (includes ':') in version 3.1.
191192

192193
## Next steps

articles/cognitive-services/Speech-Service/toc.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -372,7 +372,7 @@ items:
372372
items:
373373
- name: Neural voice migration
374374
items:
375-
- name: Migration to neural voice
375+
- name: Migration overview
376376
href: migration-overview-neural-voice.md
377377
- name: Migrate to custom neural voice
378378
href: how-to-migrate-to-custom-neural-voice.md

0 commit comments

Comments
 (0)