Suggestion: Add optional language parameter to Whisper (override automatic language detection) #2694
mark-reijerkerk
started this conversation in
Ideas
Replies: 1 comment 2 replies
-
|
Try the --language arg |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The Whisper model card shows that the decoding pipeline already includes a LANGUAGE TAG stage before transcription. This means Whisper is already internally capable of transcribing in a specific language when the tag is known.
Right now, the model always performs automatic language identification, but this can be unreliable for multilingual speakers, mixed-accent audio, or short/noisy recordings.
Proposal:
Add an optional user-provided language parameter that overrides Whisper’s automatic language detection.
When this parameter is not provided, Whisper continues to detect the language automatically as it does today.
This would give developers more control, improve transcription accuracy, and avoid unintended translation or language switching — without requiring any change to the underlying Whisper architecture.
Beta Was this translation helpful? Give feedback.
All reactions