Replies: 1 comment 6 replies
-
whisper doesn't perform well on languages with low resources (the paper said there're only 14h of tatar audio, no error rate given) to make it ready-to-use you have to fine tune whisper |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello!
I use whisper (openai-whisper==20230314) for transcribing tatar language audio [https://www.youtube.com/watch?v=w_m_q34Q-qA](from youtube video) - first changed to mp3 with
ffmpeg videoplayback.mp4 tatar.mp3
, then used commandalso tried
result is the same - have this transcription
but there are no
i
, 'ұ', 'қ' letters in tatar alphabet.Tatar alphabet:
https://en.wikipedia.org/wiki/Tatar_alphabet
(look accepted spelling, it is now used in Tatarstan (the motherland for tatars)
Beta Was this translation helpful? Give feedback.
All reactions