Replies: 2 comments 3 replies
-
i'm not sure which website you have visited, there're 4 english-only models and 5 multi lingual models for multilingual models, if you don't tell |
Beta Was this translation helpful? Give feedback.
-
Yes, the only difference would be the initial "detect the language in the first 30 seconds" stage. Like ryanheise said:
Before you submit the audio... If you know which language the audio is:
If you don't know what language the audio is:
Here's a different way of explaining it: Let's say you have a Spanish audio you want to transcribe. You could: Method A. Specify nothing:
Method B. Specify language:
Method A will go a little slower, because Whisper is taking the MP3's first 30 seconds and trying to guess what language the audio is. Method A can then:
Method B will already be told it's Spanish, so Whisper can just begin transcribing right away. Side Note: On accuracy of auto-guessing an audio's language... there can be a lot of wrenches thrown into the mix. For example, you may have:
Or, if you take a look at Whisper's breakdown of language WER graph, the further your language is down the list, the more likely auto-guessing Method A might guess wrong. For example:
The better Whisper's accuracy becomes in a given language, the better job it'll do at detecting the exact language for you. But... it'll never be as good as human who just says:
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi!,
I've been using Whisper and have come across the multilingual version. I understand that this allows Whisper to work with multiple languages simultaneously. However, I'm having trouble understanding why it's necessary to specify the language using the --language flag.
As far as I know, there is only one pre-trained mono-language model on the official website and it is in English. Therefore, I have a couple of questions:
Thank you in advance for any insights or clarification on this topic.
Beta Was this translation helpful? Give feedback.
All reactions