Live dictation combined with KaldiAG voice coding for computer control #285
shervinemami
started this conversation in
Show and tell
Replies: 1 comment
-
How do you got Whisper_mic to work, that piece just gives me |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I got OpenAI Whisper working as a natural language dictation backend for the "Kaldi Active Grammar" speech recognition / voice coding framework, allowing me to control my computer in real-time (using commands in Kaldi) with the ability to also write plain text using Whisper for dictation. Hence able to get by without a keyboard, for web browsing, document writing and coding / programming.
My code is just hacked together for now but it's a useable start, I'm sure the Kaldi Active Grammar folks will integrate it better soon.
Demo video: https://youtu.be/7OCMu6JRL-8
(Instructions for getting this working are on the KaldiAG Gitter chat at "https://gitter.im/kaldi-active-grammar/community?at=6342b59d773e7e5c7bc87cb2")
Beta Was this translation helpful? Give feedback.
All reactions