You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/cognitive-services-and-machine-learning.md
+1-13Lines changed: 1 addition & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,15 +14,7 @@ ms.date: 10/28/2021
14
14
15
15
Azure AI services provides machine learning capabilities to solve general problems such as analyzing text for emotional sentiment or analyzing images to recognize objects or faces. You don't need special machine learning or data science knowledge to use these services.
16
16
17
-
[Azure AI services](./what-are-ai-services.md) is a group of services, each supporting different, generalized prediction capabilities. The services are divided into different categories to help you find the right service.
18
-
19
-
|Service category|Purpose|
20
-
|--|--|
21
-
|[Decision](https://azure.microsoft.com/services/cognitive-services/directory/decision/)|Build apps that surface recommendations for informed and efficient decision-making.|
22
-
|[Language](https://azure.microsoft.com/services/cognitive-services/directory/lang/)|Allow your apps to process natural language with pre-built scripts, evaluate sentiment and learn how to recognize what users want.|
23
-
|[Search](https://azure.microsoft.com/services/cognitive-services/directory/search/)|Add Bing Search APIs to your apps and harness the ability to comb billions of webpages, images, videos, and news with a single API call.|
24
-
|[Speech](https://azure.microsoft.com/services/cognitive-services/directory/speech/)|Convert speech into text and text into natural-sounding speech. Translate from one language to another and enable speaker verification and recognition.|
25
-
|[Vision](https://azure.microsoft.com/services/cognitive-services/directory/vision/)|Recognize, identify, caption, index, and moderate your pictures, videos, and digital ink content.|
17
+
[Azure AI services](./what-are-ai-services.md) is a group of services, each supporting different, generalized prediction capabilities.
26
18
27
19
Use Azure AI services when you:
28
20
@@ -106,10 +98,6 @@ The following data categorizes each service by which kind of data it allows or r
106
98
107
99
The services are used in any application that can make REST API(s) or SDK calls. Examples of applications include web sites, bots, virtual or mixed reality, desktop and mobile applications.
108
100
109
-
## How is Azure Cognitive Search related to Azure AI services?
110
-
111
-
[Azure Cognitive Search](../search/search-what-is-azure-search.md) is a separate cloud search service that optionally uses Azure AI services to add image and natural language processing to indexing workloads. Azure AI services is exposed in Azure Cognitive Search through [built-in skills](../search/cognitive-search-predefined-skills.md) that wrap individual APIs. You can use a free resource for walkthroughs, but plan on creating and attaching a [billable resource](../search/cognitive-search-attach-cognitive-services.md) for larger volumes.
112
-
113
101
## How can you use Azure AI services?
114
102
115
103
Each service provides information about your data. You can combine services together to chain solutions such as converting speech (audio) to text, translating the text into many languages, then using the translated languages to get answers from a knowledge base. While Azure AI services can be used to create intelligent solutions on their own, they can also be combined with traditional machine learning projects to supplement models or accelerate the development process.
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/batch-transcription-create.md
+7-1Lines changed: 7 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -254,14 +254,20 @@ To use a Whisper model for batch transcription, you also need to set the `model`
254
254
Whisper models via batch transcription are supported in the East US, Southeast Asia, and West Europe regions.
255
255
256
256
::: zone pivot="rest-api"
257
-
You can make a [Models_ListBaseModels](https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/data-plane/Speech/SpeechToText/preview/v3.2-preview.1) request to get available base models for all locales.
257
+
You can make a [Models_ListBaseModels](https://westus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview1/operations/Models_ListBaseModels) request to get available base models for all locales.
258
258
259
259
Make an HTTP GET request as shown in the following example for the `eastus` region. Replace `YourSubscriptionKey` with your Speech resource key. Replace `eastus` if you're using a different region.
260
260
261
261
```azurecli-interactive
262
262
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
263
263
```
264
264
265
+
By default only the 100 oldest base models are returned, so you can use the `skip` and `top` query parameters to page through the results. For example, the following request returns the next 100 base models after the first 100.
266
+
267
+
```azurecli-interactive
268
+
curl -v -X GET "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base?skip=100&top=100" -H "Ocp-Apim-Subscription-Key: YourSubscriptionKey"
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/includes/release-notes/release-notes-stt.md
+5-1Lines changed: 5 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
author: eric-urban
3
3
ms.service: cognitive-services
4
4
ms.topic: include
5
-
ms.date: 09/15/2023
5
+
ms.date: 09/29/2023
6
6
ms.author: eur
7
7
---
8
8
@@ -17,6 +17,10 @@ Azure AI Speech now supports OpenAI's Whisper model via the batch transcription
17
17
18
18
Check out [What is the Whisper model?](../../whisper-overview.md) to learn more about when to use Azure AI Speech vs. Azure OpenAI Service.
19
19
20
+
#### Speech to text REST API v3.2 public preview
21
+
22
+
Speech to text REST API v3.2 is available in preview. [Speech to text REST API](../../rest-speech-to-text.md) v3.1 is generally available. Speech to text REST API v3.0 will be retired on April 1st, 2026. For more information, see the Speech to text REST API [v3.0 to v3.1](../../migrate-v3-0-to-v3-1.md) and [v3.1 to v3.2](../../migrate-v3-1-to-v3-2.md) migration guides.
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/migrate-v3-1-to-v3-2.md
-116Lines changed: 0 additions & 116 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -110,122 +110,6 @@ Added token count and token error properties to the `EvaluationProperties` prope
110
110
-`tokenInsertionCount2`: The number of recognized tokens by model2 that are insertions.
111
111
-`tokenSubstitutionCount2`: The number of recognized words by model2 that are substitutions.
112
112
113
-
### Model copy
114
-
115
-
Added the new `"/operations/models/copy/{id}"` operation. Used for copy models scenario.
116
-
117
-
Added the new `"/models/{id}:copy"` operation. Schema in the new copy operation: `"$ref": "#/definitions/ModelCopyAuthorization"` Deprecated the `"/models/{id}:copyto"` operation. Schema in the deprecated copy operation: `"$ref": "#/definitions/ModelCopy"`
118
-
119
-
Added the new `"/models:authorizecopy"` operation returns `"$ref": "#/definitions/ModelCopyAuthorization"`. This returned entity can be used in the new `"/models/{id}:copy"` operation.
120
-
121
-
New entity definitions related to model copy authorization:
122
-
-`ModelCopyAuthorization`
123
-
-`ModelCopyAuthorizationDefinition`: The Azure Resource ID of the source speech resource.
124
-
125
-
```json
126
-
"ModelCopyAuthorization": {
127
-
"title": "ModelCopyAuthorization",
128
-
"required": [
129
-
"expirationDateTime",
130
-
"id",
131
-
"sourceResourceId",
132
-
"targetResourceEndpoint",
133
-
"targetResourceId",
134
-
"targetResourceRegion"
135
-
],
136
-
"type": "object",
137
-
"properties": {
138
-
"targetResourceRegion": {
139
-
"description": "The region (aka location) of the target speech resource (e.g., westus2).",
140
-
"minLength": 1,
141
-
"type": "string"
142
-
},
143
-
"targetResourceId": {
144
-
"description": "The Azure Resource ID of the target speech resource.",
145
-
"minLength": 1,
146
-
"type": "string"
147
-
},
148
-
"targetResourceEndpoint": {
149
-
"description": "The endpoint (base url) of the target resource (with custom domain name when it is used).",
150
-
"minLength": 1,
151
-
"type": "string"
152
-
},
153
-
"sourceResourceId": {
154
-
"description": "The Azure Resource ID of the source speech resource.",
155
-
"minLength": 1,
156
-
"type": "string"
157
-
},
158
-
"expirationDateTime": {
159
-
"format": "date-time",
160
-
"description": "The expiration date of this copy authorization.",
161
-
"type": "string"
162
-
},
163
-
"id": {
164
-
"description": "The ID of this copy authorization.",
165
-
"minLength": 1,
166
-
"type": "string"
167
-
}
168
-
}
169
-
},
170
-
```
171
-
172
-
```json
173
-
"ModelCopyAuthorizationDefinition": {
174
-
"title": "ModelCopyAuthorizationDefinition",
175
-
"required": [
176
-
"sourceResourceId"
177
-
],
178
-
"type": "object",
179
-
"properties": {
180
-
"sourceResourceId": {
181
-
"description": "The Azure Resource ID of the source speech resource.",
182
-
"minLength": 1,
183
-
"type": "string"
184
-
}
185
-
}
186
-
},
187
-
```
188
-
189
-
### CustomModelLinks copy properties
190
-
191
-
New `copy` property
192
-
copyTo URI: The location to the obsolete model copy action. See operation \"Models_CopyTo\" for more details.
193
-
copy URI: The location to the model copy action. See operation \"Models_Copy\" for more details.
194
-
195
-
```json
196
-
"CustomModelLinks": {
197
-
"title": "CustomModelLinks",
198
-
"type": "object",
199
-
"properties": {
200
-
"copyTo": {
201
-
"format": "uri",
202
-
"description": "The location to the obsolete model copy action. See operation \"Models_CopyTo\" for more details.",
203
-
"type": "string",
204
-
"readOnly": true
205
-
},
206
-
"copy": {
207
-
"format": "uri",
208
-
"description": "The location to the model copy action. See operation \"Models_Copy\" for more details.",
209
-
"type": "string",
210
-
"readOnly": true
211
-
},
212
-
"files": {
213
-
"format": "uri",
214
-
"description": "The location to get all files of this entity. See operation \"Models_ListFiles\" for more details.",
215
-
"type": "string",
216
-
"readOnly": true
217
-
},
218
-
"manifest": {
219
-
"format": "uri",
220
-
"description": "The location to get a manifest for this model to be used in the on-prem container. See operation \"Models_GetCustomModelManifest\" for more details.",
221
-
"type": "string",
222
-
"readOnly": true
223
-
}
224
-
},
225
-
"readOnly": true
226
-
},
227
-
```
228
-
229
113
## Operation IDs
230
114
231
115
You must update the base path in your code from `/speechtotext/v3.1` to `/speechtotext/v3.2-preview.1`. For example, to get base models in the `eastus` region, use `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2-preview.1/models/base` instead of `https://eastus.api.cognitive.microsoft.com/speechtotext/v3.1/models/base`.
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/releasenotes.md
+1-2Lines changed: 1 addition & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,11 +21,10 @@ Azure AI Speech is updated on an ongoing basis. To stay up-to-date with recent d
21
21
22
22
## Recent highlights
23
23
24
+
* Azure AI Speech now supports OpenAI's Whisper model via the batch transcription API. To learn more, check out the [Create a batch transcription](./batch-transcription-create.md#using-whisper-models) guide.
24
25
*[Speech to text REST API version 3.2](./migrate-v3-1-to-v3-2.md) is available in public preview.
25
26
* Speech SDK 1.32.1 was released in September 2023.
26
27
*[Real-time diarization](./get-started-stt-diarization.md) is in public preview.
27
-
* Some Speech Studio [scenarios](speech-studio-overview.md#speech-studio-scenarios) are available to try without an Azure subscription.
28
-
* Text to speech [Batch synthesis API](./batch-synthesis.md) is available in public preview.
0 commit comments