Skip to content
Discussion options

You must be logged in to vote

Without language= whisper will perform language detection using the first 30s of audio. If you only care about English, then specify language=en to skip language detection.

medium is a multilingual model, whereas medium.en is an English only model.

From the README at https://github.com/openai/whisper#available-models-and-languages

The .en models for English-only applications tend to perform better, especially for the tiny.en and base.en models. We observed that the difference becomes less significant for the small.en and medium.en models.

See also the discussion here:

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by SimpleVictor
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants