Setting prefix results in empty transcription #1085
-
Hi! When providing the prefix decoding option, the model spits out an empty transcription. When I supply a whitespace, transcription quality decreases but outputs most often exist. When I supply a full sentence with "." at the end, the models output is empty. Same for sentences or words without "." How can I fix that? Thank you |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 2 replies
-
@jongwook Must the prefix contain the currently processed audio sample? Or can the prefix be arbitrary? |
Beta Was this translation helpful? Give feedback.
-
With a "." in the prefix, the model assumes the segment starts with a period mark, and my guess is that the model hasn't seen many transcripts in the training data that starts a line with a I think you might be looking for |
Beta Was this translation helpful? Give feedback.
-
#1358 If I understood your issue correctly, there was a small bug when using default |
Beta Was this translation helpful? Give feedback.
With a "." in the prefix, the model assumes the segment starts with a period mark, and my guess is that the model hasn't seen many transcripts in the training data that starts a line with a
"."
. Even when it does, it's more likely a line containing"."
only, hence an empty prediction. The same logic goes for the whitespace prefix, although the distribution may be slightly different.I think you might be looking for
prompt
instead ofprefix
(cf. #117 (comment))