[proposal] Add Speech to Text in Kilo for the Chat #962
adamhill
started this conversation in
1. Feature requests
Replies: 1 comment
-
Get it done and rebase until it lands in vscode sounds like a great plan to me. You can even use the same tricks we use ourselves for the roo merge (mark changed areas, so the merges are easier to understand) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Challenge
We currently have no way to talk to Kilo to interact with the LLM.
This may seem like a novelty to some people (hands off vibe coding) but its needed for developers that have no mobility in their hands, but otherwise have normal speech abilities.
We have a long weekend in the states so feels like a nice time to dig in and start
Proposed Solution (high level)
Basically, mimic the behavior of the Copilot VS Speech feature
Implementaton Details
.proposed.d.ts
namespace.proposed
API's.proposed
APIs & we have to use Insiders to test it.proposed
and into the released API or get a headless or not UI widget that could recieve text from the VS Speech ExtensionI have pinged (@bpasero) who seems to have implemented various VS Speech bitsand is its "shephard" according to commits, comments and external repos.
After Thursday, this might be much easier since they have open-sourced
copilot-chat
Alternative Implementations considered
Someone made a furtive collab over in the Roo Discord to try and do a web socket version that has not gone anywhere
I looked at sherpa-onnx it looks doable but waaaaaay more work
Additional Context & Mockups
I have a bunch of reasearch in Obsidian, but nothing thats easy to share. I will see what PDF export is like in Obsidian (ironically a Markdown first based system)
I am willing to see this through implementation and rebasing as needed as we wait for MS to release the API's to
stable
Thanks.
Beta Was this translation helpful? Give feedback.
All reactions