Problems with Transcribing Multilingual Audio #1367
Unanswered
RohitMidha23
asked this question in
Q&A
Replies: 2 comments
-
unfortunately whisper doesn't support multilingual audio |
Beta Was this translation helpful? Give feedback.
0 replies
-
Add this in model during generation of response - forced_decoder_ids=processor.get_decoder_prompt_ids(language="gu", task="transcribe") |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I have a video where a speaker talks in Gujarati mixed with English. To transcribe the video, I don't mind if the output is either completely in English or Gujarati.
I'm using the following snippet to transcribe the audio:
Some sample output:
I've added only the output which is relevant but happy to share full output if required.
Issues
So I noticed that there are major issues with this output which I require help with.
Chit shuddhi thaye j
I have tried enforcing language to English but even in that case the issue of translation instead of transcription occurs.
Any help would be greatly appreciated.
Beta Was this translation helpful? Give feedback.
All reactions