What's the difference? (language="en" vs "medium.en" model) #2261

SimpleVictor · 2024-07-12T16:24:43Z

SimpleVictor
Jul 12, 2024

Lets assume I only care about english. Can someone help me understand the difference between the 3 approaches below?

1. Using medium model with language="en" attached

whisper english-only.wav --model medium --language english

2. Using medium.en model with NO language="en" attached

whisper english-only.wav --model medium.en

3. Using medium.en model with language="en" attached

whisper english-only.wav --model medium.en --language english

Answered by glangford

Jul 12, 2024

Without language= whisper will perform language detection using the first 30s of audio. If you only care about English, then specify language=en to skip language detection.

medium is a multilingual model, whereas medium.en is an English only model.

From the README at https://github.com/openai/whisper#available-models-and-languages

The .en models for English-only applications tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models.

glangford
Jul 12, 2024

Without language= whisper will perform language detection using the first 30s of audio. If you only care about English, then specify language=en to skip language detection.

medium is a multilingual model, whereas medium.en is an English only model.

From the README at https://github.com/openai/whisper#available-models-and-languages

The .en models for English-only applications tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What's the difference? (language="en" vs "medium.en" model) #2261

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

What's the difference? (language="en" vs "medium.en" model) #2261

Uh oh!

Uh oh!

SimpleVictor Jul 12, 2024

Replies: 1 comment

Uh oh!

Uh oh!

glangford Jul 12, 2024

SimpleVictor
Jul 12, 2024

glangford
Jul 12, 2024