Skip to content
Discussion options

You must be logged in to vote

This happens when the model is unsure about the output (according to the compression_ratio_threshold and logprob_threshold settings). The most common failure mode is that it falls into a repeat loop, where it likely triggers the compression_ratio_threshold. The default setting tries temperatures 0, 0.2, 0.4, 0.6, 0.8, 1.0 until it gives up, at which it is less likely to be in a repeat loop but is also less likely to be correct.

You can try adding --temperature_increment_on_fallback None to prevent this behavior. In general, Whisper's performance on Chinese is not very good and would probably need fine-tuning or training from scratch to be usable.

Replies: 7 comments 11 replies

Comment options

You must be logged in to vote
4 replies
@jltchiu
Comment options

@jltchiu
Comment options

@jongwook
Comment options

@jltchiu
Comment options

Answer selected by jongwook
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@CangxiongChen
Comment options

@fleek
Comment options

Comment options

You must be logged in to vote
2 replies
@Jain-Archit
Comment options

@Atefeh197
Comment options

Comment options

You must be logged in to vote
3 replies
@Atefeh197
Comment options

@lixikun
Comment options

@vgcen
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet