You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to transcribe a large collection of lecture videos from difference speakers, in English and German. In a few videos, that are from the same speaker (thus same setup, format, etc), but not all the videos of that speaker, whisper only transcribes a few minutes at the beginning of the video. For example, the videos are almost an hour long and the transcription is done for 6-7 minutes at the beginning.
I run whisper multiple times on the same lectures, but the outcome is the same.
My current pipeline is export the audio from the videos using FFmpeg. Then use the *.wav file (also used MP3s) with standard whisper (large) to transcribe the audio.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I am trying to transcribe a large collection of lecture videos from difference speakers, in English and German. In a few videos, that are from the same speaker (thus same setup, format, etc), but not all the videos of that speaker, whisper only transcribes a few minutes at the beginning of the video. For example, the videos are almost an hour long and the transcription is done for 6-7 minutes at the beginning.
I run whisper multiple times on the same lectures, but the outcome is the same.
My current pipeline is export the audio from the videos using FFmpeg. Then use the *.wav file (also used MP3s) with standard whisper (large) to transcribe the audio.
Any idea would be greatly appreciated.
Thank you
Beta Was this translation helpful? Give feedback.
All reactions