Replies: 2 comments
-
transcript post-processing seem to be the only easy way the way u suggest would require train a model on top of whisper to do something like force words output |
Beta Was this translation helpful? Give feedback.
0 replies
-
whisper.cpp has a grammar feature that works pretty well for me; I don't know if its technique can be backported to this version. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Is it possible to constrain what Whisper recognizes to a BNF-style grammar, for command-and-control purposes?
If it's not presently implemented, can anyone speculate on where this could be integrated into the current code base?
Another way to look at it would be for Whisper to make available what it's recognized in the audio-sample so far (starting with nothing), and accepting a list of words it's allowed to recognize next, constraining its transcription to just those words.
Beta Was this translation helpful? Give feedback.
All reactions