Skip to content

Commit 69d2406

Browse files
Merge pull request #269943 from eric-urban/eur/stt-32-preview2
stt 3.2 preview2
2 parents 7998480 + 990ad3a commit 69d2406

File tree

3 files changed

+129
-14
lines changed

3 files changed

+129
-14
lines changed

articles/ai-services/speech-service/batch-transcription-create.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -260,18 +260,18 @@ To use a Whisper model for batch transcription, you need to set the `model` prop
260260
Whisper models by batch transcription are supported in the East US, Southeast Asia, and West Europe regions.
261261

262262
::: zone pivot="rest-api"
263-
You can make a [Models_ListBaseModels](https://westus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview1/operations/Models_ListBaseModels) request to get available base models for all locales.
263+
You can make a [Models_ListBaseModels](https://westus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Models_ListBaseModels) request to get available base models for all locales.
264264

265265
Make an HTTP GET request as shown in the following example for the `eastus` region. Replace `YourSubscriptionKey` with your Speech resource key. Replace `eastus` if you're using a different region.
266266

267267
```azurecli-interactive
268-
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
268+
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
269269
```
270270

271271
By default, only the 100 oldest base models are returned. Use the `skip` and `top` query parameters to page through the results. For example, the following request returns the next 100 base models after the first 100.
272272

273273
```azurecli-interactive
274-
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base?skip=100&top=100" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
274+
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base?skip=100&top=100" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
275275
```
276276

277277
::: zone-end
@@ -280,7 +280,7 @@ curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-pre
280280
Make sure that you set the [configuration variables](spx-basics.md#create-a-resource-configuration) for a Speech resource in one of the supported regions. You can run the `spx csr list --base` command to get available base models for all locales.
281281

282282
```azurecli
283-
spx csr list --base --api-version v3.2-preview.1
283+
spx csr list --base --api-version v3.2-preview.2
284284
```
285285

286286
::: zone-end
@@ -289,9 +289,9 @@ The `displayName` property of a Whisper model contains "Whisper" as shown in thi
289289

290290
```json
291291
{
292-
"self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base/e418c4a9-9937-4db7-b2c9-8afbff72d950",
292+
"self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base/e418c4a9-9937-4db7-b2c9-8afbff72d950",
293293
"links": {
294-
"manifest": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base/e418c4a9-9937-4db7-b2c9-8afbff72d950/manifest"
294+
"manifest": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base/e418c4a9-9937-4db7-b2c9-8afbff72d950/manifest"
295295
},
296296
"properties": {
297297
"deprecationDates": {
@@ -333,20 +333,20 @@ curl -v -X POST -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey" -H "Content-
333333
"locale": "en-US",
334334
"displayName": "My Transcription",
335335
"model": {
336-
"self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base/d9cbeee6-582b-47ad-b5c1-6226583c92b6"
336+
"self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base/d9cbeee6-582b-47ad-b5c1-6226583c92b6"
337337
},
338338
"properties": {
339339
"wordLevelTimestampsEnabled": true,
340340
},
341-
}' "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/transcriptions"
341+
}' "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/transcriptions"
342342
```
343343

344344
::: zone-end
345345

346346
::: zone pivot="speech-cli"
347347

348348
```azurecli
349-
spx batch transcription create --name "My Transcription" --language "en-US" --content https://crbn.us/hello.wav;https://crbn.us/whatstheweatherlike.wav --model "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base/d9cbeee6-582b-47ad-b5c1-6226583c92b6" --api-version v3.2-preview.1
349+
spx batch transcription create --name "My Transcription" --language "en-US" --content https://crbn.us/hello.wav;https://crbn.us/whatstheweatherlike.wav --model "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base/d9cbeee6-582b-47ad-b5c1-6226583c92b6" --api-version v3.2-preview.2
350350
```
351351

352352
::: zone-end

articles/ai-services/speech-service/migrate-v3-1-to-v3-2.md

Lines changed: 119 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ The Speech to text REST API is used for [Batch transcription](batch-transcriptio
2424

2525
## Base path
2626

27-
You must update the base path in your code from `/speechtotext/v3.1` to `/speechtotext/v3.2-preview.1`. For example, to get base models in the `eastus` region, use `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base` instead of `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.1/models/base`.
27+
You must update the base path in your code from `/speechtotext/v3.1` to `/speechtotext/v3.2-preview.2`. For example, to get base models in the `eastus` region, use `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base` instead of `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.1/models/base`.
2828

2929
For more information, see [Operation IDs](#operation-ids) later in this guide.
3030

@@ -57,7 +57,7 @@ Azure AI Speech now supports OpenAI's Whisper model via Speech to text REST API
5757
5858
### Custom display text formatting
5959

60-
To support model adaptation with [custom display text formatting](how-to-custom-speech-test-and-train.md#custom-display-text-formatting-data-for-training) data, the [Datasets_Create](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview1/operations/Datasets_Create) operation supports the **OutputFormatting** data kind. For more information, see [upload datasets](how-to-custom-speech-upload-data.md#upload-datasets).
60+
To support model adaptation with [custom display text formatting](how-to-custom-speech-test-and-train.md#custom-display-text-formatting-data-for-training) data, the [Datasets_Create](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Datasets_Create) operation supports the **OutputFormatting** data kind. For more information, see [upload datasets](how-to-custom-speech-upload-data.md#upload-datasets).
6161

6262
Added a definition for `OutputFormatType` with `Lexical` and `Display` enum values.
6363

@@ -174,15 +174,130 @@ Added token count and token error properties to the `EvaluationProperties` prope
174174
- `tokenInsertionCount2`: The number of recognized tokens by model2 that are insertions.
175175
- `tokenSubstitutionCount2`: The number of recognized words by model2 that are substitutions.
176176

177+
178+
### Model copy
179+
180+
Added the new `"/operations/models/copy/{id}"` operation. Used for copy models scenario.
181+
Added the new `"/models/{id}:copy"` operation. Schema in the new copy operation: `"$ref": "#/definitions/ModelCopyAuthorization"` Deprecated the `"/models/{id}:copyto"` operation. Schema in the deprecated copy operation: `"$ref": "#/definitions/ModelCopy"`
182+
Added the new `"/models:authorizecopy"` operation returns `"$ref": "#/definitions/ModelCopyAuthorization"`. This returned entity can be used in the new `"/models/{id}:copy"` operation.
183+
184+
Added a new entity definition for `ModelCopyAuthorization`:
185+
186+
```json
187+
"ModelCopyAuthorization": {
188+
"title": "ModelCopyAuthorization",
189+
"required": [
190+
"expirationDateTime",
191+
"id",
192+
"sourceResourceId",
193+
"targetResourceEndpoint",
194+
"targetResourceId",
195+
"targetResourceRegion"
196+
],
197+
"type": "object",
198+
"properties": {
199+
"targetResourceRegion": {
200+
"description": "The region (aka location) of the target speech resource (e.g., westus2).",
201+
"minLength": 1,
202+
"type": "string"
203+
},
204+
"targetResourceId": {
205+
"description": "The Azure Resource ID of the target speech resource.",
206+
"minLength": 1,
207+
"type": "string"
208+
},
209+
"targetResourceEndpoint": {
210+
"description": "The endpoint (base url) of the target resource (with custom domain name when it is used).",
211+
"minLength": 1,
212+
"type": "string"
213+
},
214+
"sourceResourceId": {
215+
"description": "The Azure Resource ID of the source speech resource.",
216+
"minLength": 1,
217+
"type": "string"
218+
},
219+
"expirationDateTime": {
220+
"format": "date-time",
221+
"description": "The expiration date of this copy authorization.",
222+
"type": "string"
223+
},
224+
"id": {
225+
"description": "The ID of this copy authorization.",
226+
"minLength": 1,
227+
"type": "string"
228+
}
229+
}
230+
},
231+
```
232+
233+
Added a new entity definition for `ModelCopyAuthorizationDefinition`:
234+
235+
```json
236+
"ModelCopyAuthorizationDefinition": {
237+
"title": "ModelCopyAuthorizationDefinition",
238+
"required": [
239+
"sourceResourceId"
240+
],
241+
"type": "object",
242+
"properties": {
243+
"sourceResourceId": {
244+
"description": "The Azure Resource ID of the source speech resource.",
245+
"minLength": 1,
246+
"type": "string"
247+
}
248+
}
249+
},
250+
```
251+
252+
### CustomModelLinks copy properties
253+
254+
Added a new `copy` property.
255+
copyTo URI: The location to the obsolete model copy action. See operation \"Models_CopyTo\" for more details.
256+
copy URI: The location to the model copy action. See operation \"Models_Copy\" for more details.
257+
258+
```json
259+
"CustomModelLinks": {
260+
"title": "CustomModelLinks",
261+
"type": "object",
262+
"properties": {
263+
"copyTo": {
264+
"format": "uri",
265+
"description": "The location to the obsolete model copy action. See operation \"Models_CopyTo\" for more details.",
266+
"type": "string",
267+
"readOnly": true
268+
},
269+
"copy": {
270+
"format": "uri",
271+
"description": "The location to the model copy action. See operation \"Models_Copy\" for more details.",
272+
"type": "string",
273+
"readOnly": true
274+
},
275+
"files": {
276+
"format": "uri",
277+
"description": "The location to get all files of this entity. See operation \"Models_ListFiles\" for more details.",
278+
"type": "string",
279+
"readOnly": true
280+
},
281+
"manifest": {
282+
"format": "uri",
283+
"description": "The location to get a manifest for this model to be used in the on-prem container. See operation \"Models_GetCustomModelManifest\" for more details.",
284+
"type": "string",
285+
"readOnly": true
286+
}
287+
},
288+
"readOnly": true
289+
},
290+
```
291+
177292
## Operation IDs
178293

179-
You must update the base path in your code from `/speechtotext/v3.1` to `/speechtotext/v3.2-preview.1`. For example, to get base models in the `eastus` region, use `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base` instead of `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.1/models/base`.
294+
You must update the base path in your code from `/speechtotext/v3.1` to `/speechtotext/v3.2-preview.2`. For example, to get base models in the `eastus` region, use `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.2/models/base` instead of `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.1/models/base`.
180295

181296

182297
## Next steps
183298

184299
* [Speech to text REST API](rest-speech-to-text.md)
185-
* [Speech to text REST API v3.2 (preview)](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview1)
300+
* [Speech to text REST API v3.2 (preview)](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2)
186301
* [Speech to text REST API v3.1 reference](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-1)
187302
* [Speech to text REST API v3.0 reference](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0)
188303

articles/ai-services/speech-service/rest-speech-to-text.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ Speech to text REST API is used for [batch transcription](batch-transcription.md
2222
> Speech to text REST API v3.0 will be retired on April 1st, 2026. For more information, see the Speech to text REST API [v3.0 to v3.1](migrate-v3-0-to-v3-1.md) and [v3.1 to v3.2](migrate-v3-1-to-v3-2.md) migration guides.
2323
2424
> [!div class="nextstepaction"]
25-
> [See the Speech to text REST API v3.2 (preview)](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview1)
25+
> [See the Speech to text REST API v3.2 (preview)](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2)
2626
2727
> [!div class="nextstepaction"]
2828
> [See the Speech to text REST API v3.1 reference documentation](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-1/)

0 commit comments

Comments
 (0)