Replies: 1 comment
-
Hi @vlashada , I came across this old post. I'm keen to hear if you have found better strategies / settings than above parameter values. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I have the mp4 files for Peppa Pig with Chinese dub. I am trying to make a transcript using OpenAI Whisper, but I am struggling to get high-quality output. I experience frequent hallucinations or repeats of words.
For preprocessing, I am removing the into and outro of the video. For postprocessing, I am removing lines if they are really long, have very many repeated words or have many non-Hanzi characters. These are my current options for the transcript:
I have tried looking for a comprehensive guide into what all the parameters are, and what the best strategy is for getting the best performance, but I am struggling to find any good resources. I have a Quadro RTX 6000, so I am easily able to run large-v3.
Beta Was this translation helpful? Give feedback.
All reactions