Replies: 6 comments 1 reply
-
Beta Was this translation helpful? Give feedback.
-
I set --condition_on_previous_text to False and it looks like fix the issue, but --initial_prompt doesn't seem to work anymore, the output content will change back and forth between Simplified Chinese and Traditional Chinese |
Beta Was this translation helpful? Give feedback.
-
Anyone konw how to fix it without set --condition_on_previous_text to False? Accuracy drops too much after disabling. |
Beta Was this translation helpful? Give feedback.
-
I have observed this before and it happens rather randomly. I only see it with large models, but I don't use the other ones as much. I am mostly doing Chinese speech to text with or without translation. |
Beta Was this translation helpful? Give feedback.
-
@miku1958 Hello, I encountered the same issue as you did. I made some modifications #1253 to the prompt and solved my issue. You can try the modified branch by running the following command: pip uninstall openai-whisper
pip install git+https://github.com/heimoshuiyu/whisper.git@prompt --upgrade |
Beta Was this translation helpful? Give feedback.
-
Hi @heimoshuiyu , this is quite interesting approach. If you don't mind, would you share your insight on why you decided not to introduce a VAD for your usecase? Would VAD affect the quality of transcription negatively? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I tried to use "whisper --model large-v2 --output_format json --language Chinese" to process an audio of more than 4 hours, the first part works fine, but after 3 hours and 40 minutes it keeps repeating a certain word "小蜘蛛"(little spider in chinese) until the end. I have checked the audio, there is normal content in the back.
Does anyone know how to fix it? I am using Windows10, Cuda 11.7.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions