Skip to content

Commit 7000406

Browse files
Merge pull request #270240 from eric-urban/eur/stt-whisper-updates
stt whisper updates
2 parents 55649d5 + 2469e96 commit 7000406

File tree

2 files changed

+13
-10
lines changed

2 files changed

+13
-10
lines changed

articles/ai-services/speech-service/migrate-v3-1-to-v3-2.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: how-to
9-
ms.date: 1/21/2024
9+
ms.date: 3/26/2024
1010
ms.author: eur
1111
ms.devlang: csharp
1212
ms.custom: devx-track-csharp
@@ -177,9 +177,10 @@ Added token count and token error properties to the `EvaluationProperties` prope
177177

178178
### Model copy
179179

180-
Added the new `"/operations/models/copy/{id}"` operation. Used for copy models scenario.
181-
Added the new `"/models/{id}:copy"` operation. Schema in the new copy operation: `"$ref": "#/definitions/ModelCopyAuthorization"` Deprecated the `"/models/{id}:copyto"` operation. Schema in the deprecated copy operation: `"$ref": "#/definitions/ModelCopy"`
182-
Added the new `"/models:authorizecopy"` operation returns `"$ref": "#/definitions/ModelCopyAuthorization"`. This returned entity can be used in the new `"/models/{id}:copy"` operation.
180+
The following changes are for the scenario where you copy a model.
181+
- Added the new [Models_Copy](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Models_Copy) operation. Here's the schema in the new copy operation: `"$ref": "#/definitions/ModelCopyAuthorization"`
182+
- Deprecated the [Models_CopyTo](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Models_CopyTo) operation. Here's the schema in the deprecated copy operation: `"$ref": "#/definitions/ModelCopy"`
183+
- Added the new [Models_AuthorizeCopy](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Models_AuthorizeCopy) operation that returns `"$ref": "#/definitions/ModelCopyAuthorization"`. This returned entity can be used in the new [Models_Copy](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Models_Copy) operation.
183184

184185
Added a new entity definition for `ModelCopyAuthorization`:
185186

@@ -252,8 +253,8 @@ Added a new entity definition for `ModelCopyAuthorizationDefinition`:
252253
### CustomModelLinks copy properties
253254

254255
Added a new `copy` property.
255-
copyTo URI: The location to the obsolete model copy action. See operation \"Models_CopyTo\" for more details.
256-
copy URI: The location to the model copy action. See operation \"Models_Copy\" for more details.
256+
- `copyTo` URI: The location of the obsolete model copy action. See the [Models_CopyTo](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Models_CopyTo) operation for more details.
257+
- `copy` URI: The location of the model copy action. See the [Models_Copy](https://eastus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-2-preview2/operations/Models_Copy) operation for more details.
257258

258259
```json
259260
"CustomModelLinks": {

articles/ai-services/speech-service/whisper-overview.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,23 +6,25 @@ author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: overview
9-
ms.date: 1/22/2024
9+
ms.date: 3/26/2024
1010
ms.author: eur
1111
---
1212

1313
# What is the Whisper model?
1414

1515
The Whisper model is a speech to text model from OpenAI that you can use to transcribe audio files. The model is trained on a large dataset of English audio and text. The model is optimized for transcribing audio files that contain speech in English. The model can also be used to transcribe audio files that contain speech in other languages. The output of the model is English text.
1616

17+
Whisper models are available via the Azure OpenAI Service or via Azure AI Speech. The features differ for those offerings. In Azure AI Speech, Whisper is just one of several speech to text models that you can use.
18+
1719
You might ask:
1820

1921
- Is the Whisper Model a good choice for my scenario, or is an Azure AI Speech model better? What are the API comparisons between the two types of models?
2022

2123
- If I want to use the Whisper Model, should I use it via the Azure OpenAI Service or via Azure AI Speech? What are the scenarios that guide me to use one or the other?
2224

23-
## Whisper model via Azure AI Speech models
25+
## Whisper model or Azure AI Speech models
2426

25-
Either the Whisper model or the Azure AI Speech models are appropriate depending on your scenarios. The following table compares options with recommendations about where to start.
27+
Either the Whisper model or the Azure AI Speech models are appropriate depending on your scenarios. If you decide to use Azure AI Speech, you can choose from several models, including the Whisper model. The following table compares options with recommendations about where to start.
2628

2729
| Scenario | Whisper model | Azure AI Speech models |
2830
|---------|---------------|------------------------|
@@ -41,7 +43,7 @@ Either the Whisper model or the Azure AI Speech models are appropriate depending
4143

4244
## Whisper model via Azure AI Speech or via Azure OpenAI Service?
4345

44-
You can choose whether to use the Whisper Model via [Azure OpenAI](../openai/whisper-quickstart.md) or via [Azure AI Speech](./batch-transcription-create.md#use-a-whisper-model). In either case, the readability of the transcribed text is the same. You can input mixed language audio and the output is in English.
46+
If you decide to use the Whisper model, you have two options. You can choose whether to use the Whisper Model via [Azure OpenAI](../openai/whisper-quickstart.md) or via [Azure AI Speech](./batch-transcription-create.md#use-a-whisper-model). In either case, the readability of the transcribed text is the same. You can input mixed language audio and the output is in English.
4547

4648
Whisper Model via Azure OpenAI Service might be best for:
4749
- Quickly transcribing audio files one at a time

0 commit comments

Comments
 (0)