Constrain recognition to a BNF-style grammar for command and control? #2100

ulatekh · 2024-03-21T21:32:55Z

ulatekh
Mar 21, 2024

Is it possible to constrain what Whisper recognizes to a BNF-style grammar, for command-and-control purposes?

If it's not presently implemented, can anyone speculate on where this could be integrated into the current code base?

Another way to look at it would be for Whisper to make available what it's recognized in the audio-sample so far (starting with nothing), and accepting a list of words it's allowed to recognize next, constraining its transcription to just those words.

phineas-pta · 2024-03-22T00:53:34Z

phineas-pta
Mar 22, 2024

transcript post-processing seem to be the only easy way

the way u suggest would require train a model on top of whisper to do something like force words output

0 replies

ulatekh · 2024-03-27T21:59:21Z

ulatekh
Mar 27, 2024
Author

whisper.cpp has a grammar feature that works pretty well for me; I don't know if its technique can be backported to this version.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Constrain recognition to a BNF-style grammar for command and control? #2100

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Constrain recognition to a BNF-style grammar for command and control? #2100

Uh oh!

ulatekh Mar 21, 2024

Replies: 2 comments

Uh oh!

phineas-pta Mar 22, 2024

Uh oh!

Uh oh!

ulatekh Mar 27, 2024 Author

ulatekh
Mar 21, 2024

phineas-pta
Mar 22, 2024

ulatekh
Mar 27, 2024
Author