You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/whisper-overview.md
+12-10Lines changed: 12 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,7 +13,9 @@ ms.author: eur
13
13
14
14
# What is the Whisper model?
15
15
16
-
The Whisper model is a speech to text model from OpenAI that you can use to transcribe audio files. The model is trained on a large dataset of English audio and text. The model is optimized for transcribing audio files that contain speech in English. The model can also be used to transcribe audio files that contain speech in other languages. The output of the model is English text.
16
+
The Whisper model is a speech to text model from OpenAI that you can use to transcribe or translate audio files. The model is trained on a large dataset of English audio and text.
17
+
- The model is optimized for transcribing audio files that contain speech in English.
18
+
- The model can also be used to translate audio files that contain speech in other languages. The output of the transcription is English text.
17
19
18
20
Whisper models are available via the Azure OpenAI Service or via Azure AI Speech. The features differ for those offerings. In [Azure AI Speech (batch transcription)](./batch-transcription-create.md#use-a-whisper-model), Whisper is just one of several models that you can use for speech to text.
19
21
@@ -38,20 +40,20 @@ Either the Whisper model or the Azure AI Speech models are appropriate depending
38
40
| Contact center voice agent: Call routing and interactive voice response for call centers. | Available | Recommended |
39
41
| Voice assistant: Application specific voice assistant for a set-top box, mobile app, in-car, and other scenarios. | Available | Recommended |
40
42
| Pronunciation assessment: Assess the pronunciation of a speaker's voice. | Not available | Recommended |
41
-
| Translate live audio from one language to another. | Not available | Recommended via the [speech translation API](./speech-translation.md)|
42
-
| Translate prerecorded audio from other languages into English. | Recommended |Available via the [speech translation API](./speech-translation.md)|
43
-
| Translate prerecorded audio into languages other than English. | Not available | Recommended via the [speech translation API](./speech-translation.md)|
43
+
| Translate live audio from one language to another. | Not available | Recommended via the [speech translation API](./speech-translation.md).|
44
+
| Translate prerecorded audio from other languages into English. | Recommended |Also available via the [speech translation API](./speech-translation.md).|
45
+
| Translate prerecorded audio into languages other than English. | Not available | Recommended via the [speech translation API](./speech-translation.md).|
44
46
45
47
## Whisper model via Azure AI Speech or via Azure OpenAI Service?
46
48
47
-
If you decide to use the Whisper model, you have two options. You can choose whether to use the Whisper Model via [Azure OpenAI](../openai/whisper-quickstart.md) or via [Azure AI Speech (batch transcription)](./batch-transcription-create.md#use-a-whisper-model). In either case, the readability of the transcribed text is the same. You can input mixed language audio and the output is in English.
49
+
If you decide to use the Whisper model, you have two options. You can choose whether to use the Whisper Model via [Azure OpenAI Service](../openai/whisper-quickstart.md) or via [Azure AI Speech (batch transcription)](./batch-transcription-create.md#use-a-whisper-model). In either case, the readability of the transcribed text is the same.
48
50
49
51
Whisper Model via Azure OpenAI Service might be best for:
50
-
- Quickly transcribing audio files one at a time
51
-
- Translate audio from other languages into English
52
-
- Provide a prompt to the model to guide the output
0 commit comments