V1 Large still working best for me compared to V2 or V3 for English #1836
Replies: 4 comments 19 replies
-
Can you share the audio or give a pointer to the Youtube content |
Beta Was this translation helpful? Give feedback.
-
I haven't tried large models yet, but these settings give good results:
[00:00.000 --> 00:08.000] In this tutorial video, I will guide you through setting up your Stable Diffusion XL SDXL Koia Training Notebook on a free Kaggle account. |
Beta Was this translation helpful? Give feedback.
-
--best_of 10 --beam_size 10 making significant difference in terms of hallucination and quality what is patience doing? |
Beta Was this translation helpful? Give feedback.
-
granted I'm using faster-whisper/whisper-ctranslate2, but i'm finding the options discussed here ( So I'm back to my old parameters:
I should probably try this with the latest update to whisper itself to see if |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This is so weird. I use below settings and V2 and V3 hallucinates a lot but V1 performs best
--model large-v1 --language en --initial_prompt "Welcome our Youtube channel." --best_of 10 --beam_size 10
moreover I still have punctuation completely loss issue
Beta Was this translation helpful? Give feedback.
All reactions