Replies: 1 comment 3 replies
-
whisper accuracy on uzbek language is very bad (WER > 90% see paper), so u may want to fine tune custom tokenizer meaning u have to re-train whisper from scratch |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I use whiper-small tokenizer of Uzbek language.
It's working but, something wrong.
Because, it's returned non-Uzbek characters.
How good (correct) it is?
How to fix that?
Can I make custom tokenizer? (share some articles).
Beta Was this translation helpful? Give feedback.
All reactions