Skip to content

Commit e5ca757

Browse files
authored
Merge pull request #253300 from eric-urban/eur/stt-32-updates
stt 3.2 preview updates
2 parents 72ba25a + 79d7bdd commit e5ca757

File tree

5 files changed

+14
-133
lines changed

5 files changed

+14
-133
lines changed

articles/ai-services/cognitive-services-and-machine-learning.md

Lines changed: 1 addition & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -14,15 +14,7 @@ ms.date: 10/28/2021
1414

1515
Azure AI services provides machine learning capabilities to solve general problems such as analyzing text for emotional sentiment or analyzing images to recognize objects or faces. You don't need special machine learning or data science knowledge to use these services.
1616

17-
[Azure AI services](./what-are-ai-services.md) is a group of services, each supporting different, generalized prediction capabilities. The services are divided into different categories to help you find the right service.
18-
19-
|Service category|Purpose|
20-
|--|--|
21-
|[Decision](https://azure.microsoft.com/services/cognitive-services/directory/decision/)|Build apps that surface recommendations for informed and efficient decision-making.|
22-
|[Language](https://azure.microsoft.com/services/cognitive-services/directory/lang/)|Allow your apps to process natural language with pre-built scripts, evaluate sentiment and learn how to recognize what users want.|
23-
|[Search](https://azure.microsoft.com/services/cognitive-services/directory/search/)|Add Bing Search APIs to your apps and harness the ability to comb billions of webpages, images, videos, and news with a single API call.|
24-
|[Speech](https://azure.microsoft.com/services/cognitive-services/directory/speech/)|Convert speech into text and text into natural-sounding speech. Translate from one language to another and enable speaker verification and recognition.|
25-
|[Vision](https://azure.microsoft.com/services/cognitive-services/directory/vision/)|Recognize, identify, caption, index, and moderate your pictures, videos, and digital ink content.|
17+
[Azure AI services](./what-are-ai-services.md) is a group of services, each supporting different, generalized prediction capabilities.
2618

2719
Use Azure AI services when you:
2820

@@ -106,10 +98,6 @@ The following data categorizes each service by which kind of data it allows or r
10698

10799
The services are used in any application that can make REST API(s) or SDK calls. Examples of applications include web sites, bots, virtual or mixed reality, desktop and mobile applications.
108100

109-
## How is Azure Cognitive Search related to Azure AI services?
110-
111-
[Azure Cognitive Search](../search/search-what-is-azure-search.md) is a separate cloud search service that optionally uses Azure AI services to add image and natural language processing to indexing workloads. Azure AI services is exposed in Azure Cognitive Search through [built-in skills](../search/cognitive-search-predefined-skills.md) that wrap individual APIs. You can use a free resource for walkthroughs, but plan on creating and attaching a [billable resource](../search/cognitive-search-attach-cognitive-services.md) for larger volumes.
112-
113101
## How can you use Azure AI services?
114102

115103
Each service provides information about your data. You can combine services together to chain solutions such as converting speech (audio) to text, translating the text into many languages, then using the translated languages to get answers from a knowledge base. While Azure AI services can be used to create intelligent solutions on their own, they can also be combined with traditional machine learning projects to supplement models or accelerate the development process.

articles/ai-services/speech-service/batch-transcription-create.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -254,14 +254,20 @@ To use a Whisper model for batch transcription, you also need to set the `model`
254254
Whisper models via batch transcription are supported in the East US, Southeast Asia, and West Europe regions.
255255

256256
::: zone pivot="rest-api"
257-
You can make a [Models_ListBaseModels](https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/data-plane/Speech/SpeechToText/preview/v3.2-preview.1) request to get available base models for all locales.
257+
You can make a [Models_ListBaseModels](https://westus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview1/operations/Models_ListBaseModels) request to get available base models for all locales.
258258

259259
Make an HTTP GET request as shown in the following example for the `eastus` region. Replace `YourSubscriptionKey` with your Speech resource key. Replace `eastus` if you're using a different region.
260260

261261
```azurecli-interactive
262262
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
263263
```
264264

265+
By default only the 100 oldest base models are returned, so you can use the `skip` and `top` query parameters to page through the results. For example, the following request returns the next 100 base models after the first 100.
266+
267+
```azurecli-interactive
268+
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base?skip=100&top=100" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
269+
``````
270+
265271
::: zone-end
266272
267273
::: zone pivot="speech-cli"

articles/ai-services/speech-service/includes/release-notes/release-notes-stt.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
author: eric-urban
33
ms.service: cognitive-services
44
ms.topic: include
5-
ms.date: 09/15/2023
5+
ms.date: 09/29/2023
66
ms.author: eur
77
---
88

@@ -17,6 +17,10 @@ Azure AI Speech now supports OpenAI's Whisper model via the batch transcription
1717
1818
Check out [What is the Whisper model?](../../whisper-overview.md) to learn more about when to use Azure AI Speech vs. Azure OpenAI Service.
1919

20+
#### Speech to text REST API v3.2 public preview
21+
22+
Speech to text REST API v3.2 is available in preview. [Speech to text REST API](../../rest-speech-to-text.md) v3.1 is generally available. Speech to text REST API v3.0 will be retired on April 1st, 2026. For more information, see the Speech to text REST API [v3.0 to v3.1](../../migrate-v3-0-to-v3-1.md) and [v3.1 to v3.2](../../migrate-v3-1-to-v3-2.md) migration guides.
23+
2024
### August 2023 release
2125

2226
#### New Speech to text-locales:

articles/ai-services/speech-service/migrate-v3-1-to-v3-2.md

Lines changed: 0 additions & 116 deletions
Original file line numberDiff line numberDiff line change
@@ -110,122 +110,6 @@ Added token count and token error properties to the `EvaluationProperties` prope
110110
- `tokenInsertionCount2`: The number of recognized tokens by model2 that are insertions.
111111
- `tokenSubstitutionCount2`: The number of recognized words by model2 that are substitutions.
112112

113-
### Model copy
114-
115-
Added the new `"/operations/models/copy/{id}"` operation. Used for copy models scenario.
116-
117-
Added the new `"/models/{id}:copy"` operation. Schema in the new copy operation: `"$ref": "#/definitions/ModelCopyAuthorization"` Deprecated the `"/models/{id}:copyto"` operation. Schema in the deprecated copy operation: `"$ref": "#/definitions/ModelCopy"`
118-
119-
Added the new `"/models:authorizecopy"` operation returns `"$ref": "#/definitions/ModelCopyAuthorization"`. This returned entity can be used in the new `"/models/{id}:copy"` operation.
120-
121-
New entity definitions related to model copy authorization:
122-
- `ModelCopyAuthorization`
123-
- `ModelCopyAuthorizationDefinition`: The Azure Resource ID of the source speech resource.
124-
125-
```json
126-
"ModelCopyAuthorization": {
127-
"title": "ModelCopyAuthorization",
128-
"required": [
129-
"expirationDateTime",
130-
"id",
131-
"sourceResourceId",
132-
"targetResourceEndpoint",
133-
"targetResourceId",
134-
"targetResourceRegion"
135-
],
136-
"type": "object",
137-
"properties": {
138-
"targetResourceRegion": {
139-
"description": "The region (aka location) of the target speech resource (e.g., westus2).",
140-
"minLength": 1,
141-
"type": "string"
142-
},
143-
"targetResourceId": {
144-
"description": "The Azure Resource ID of the target speech resource.",
145-
"minLength": 1,
146-
"type": "string"
147-
},
148-
"targetResourceEndpoint": {
149-
"description": "The endpoint (base url) of the target resource (with custom domain name when it is used).",
150-
"minLength": 1,
151-
"type": "string"
152-
},
153-
"sourceResourceId": {
154-
"description": "The Azure Resource ID of the source speech resource.",
155-
"minLength": 1,
156-
"type": "string"
157-
},
158-
"expirationDateTime": {
159-
"format": "date-time",
160-
"description": "The expiration date of this copy authorization.",
161-
"type": "string"
162-
},
163-
"id": {
164-
"description": "The ID of this copy authorization.",
165-
"minLength": 1,
166-
"type": "string"
167-
}
168-
}
169-
},
170-
```
171-
172-
```json
173-
"ModelCopyAuthorizationDefinition": {
174-
"title": "ModelCopyAuthorizationDefinition",
175-
"required": [
176-
"sourceResourceId"
177-
],
178-
"type": "object",
179-
"properties": {
180-
"sourceResourceId": {
181-
"description": "The Azure Resource ID of the source speech resource.",
182-
"minLength": 1,
183-
"type": "string"
184-
}
185-
}
186-
},
187-
```
188-
189-
### CustomModelLinks copy properties
190-
191-
New `copy` property
192-
copyTo URI: The location to the obsolete model copy action. See operation \"Models_CopyTo\" for more details.
193-
copy URI: The location to the model copy action. See operation \"Models_Copy\" for more details.
194-
195-
```json
196-
"CustomModelLinks": {
197-
"title": "CustomModelLinks",
198-
"type": "object",
199-
"properties": {
200-
"copyTo": {
201-
"format": "uri",
202-
"description": "The location to the obsolete model copy action. See operation \"Models_CopyTo\" for more details.",
203-
"type": "string",
204-
"readOnly": true
205-
},
206-
"copy": {
207-
"format": "uri",
208-
"description": "The location to the model copy action. See operation \"Models_Copy\" for more details.",
209-
"type": "string",
210-
"readOnly": true
211-
},
212-
"files": {
213-
"format": "uri",
214-
"description": "The location to get all files of this entity. See operation \"Models_ListFiles\" for more details.",
215-
"type": "string",
216-
"readOnly": true
217-
},
218-
"manifest": {
219-
"format": "uri",
220-
"description": "The location to get a manifest for this model to be used in the on-prem container. See operation \"Models_GetCustomModelManifest\" for more details.",
221-
"type": "string",
222-
"readOnly": true
223-
}
224-
},
225-
"readOnly": true
226-
},
227-
```
228-
229113
## Operation IDs
230114

231115
You must update the base path in your code from `/speechtotext/v3.1` to `/speechtotext/v3.2-preview.1`. For example, to get base models in the `eastus` region, use `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base` instead of `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.1/models/base`.

articles/ai-services/speech-service/releasenotes.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,11 +21,10 @@ Azure AI Speech is updated on an ongoing basis. To stay up-to-date with recent d
2121
2222
## Recent highlights
2323

24+
* Azure AI Speech now supports OpenAI's Whisper model via the batch transcription API. To learn more, check out the [Create a batch transcription](./batch-transcription-create.md#using-whisper-models) guide.
2425
* [Speech to text REST API version 3.2](./migrate-v3-1-to-v3-2.md) is available in public preview.
2526
* Speech SDK 1.32.1 was released in September 2023.
2627
* [Real-time diarization](./get-started-stt-diarization.md) is in public preview.
27-
* Some Speech Studio [scenarios](speech-studio-overview.md#speech-studio-scenarios) are available to try without an Azure subscription.
28-
* Text to speech [Batch synthesis API](./batch-synthesis.md) is available in public preview.
2928

3029
## Release notes
3130

0 commit comments

Comments
 (0)