Whisper struggling with transcribing some YouTube videos #1494
-
I have used Whisper for transcription few weeks before and it worked well even for long videos. But now when I am trying to transcribe this video https://www.youtube.com/watch?v=KqbXZJ80yFU it results in hallucinations. The transcription makes no sense. I have tried to trim the file, set the required sampling rate (16000) and converted the file to mono wav as well but no luck so far. I am not sure what is wrong with this file. I even removed the first few seconds where there is no speech or sound. I tried this with other videos from YouTube as well and its struggling with the transcription. I have tried base, small, medium and large models as well. The input video is clear and audible but still I am not sure what is wrong with the file. Here is the transcription generated by Large Whisper model (file trimmed to first 3 minutes)
Any help in this regard is highly appreciated. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
whisper misdetect the language u must specify |
Beta Was this translation helpful? Give feedback.
as i said: whisper misdetect the language
for better transcriptions, u should always specify language