voxagent

Voice-powered terminal agent. Fully offline.

Press a key, speak, get an answer. Nothing leaves your machine.

Meta
Powered by

Quick start

npm install -g voxagent
voxagent

Requirements

Node.js 18+
Ollama running locally

That's it. No API keys. No cloud accounts. No recurring costs.

On first run, voxagent downloads a small whisper model (~150 MB) for speech-to-text. Everything runs on your machine.

Usage

$ voxagent

Press ENTER to speak...
[Recording...] Press ENTER to stop.

Transcribing...
You: What's the default port for PostgreSQL?

Thinking...
PostgreSQL runs on port 5432 by default.

Press ENTER to speak...

Options

--model <name>   Ollama model to use (default: llama3.2)
--help, -h       Show help
--version, -v    Show version

How it works

voxagent captures your voice with decibri, transcribes it locally with whisper.cpp, sends the text to your local Ollama model, and prints the response.

No audio is recorded, stored, or transmitted. Ever.

Powered by

decibri - cross-platform microphone capture
whisper.cpp - local speech-to-text
Ollama - local LLM inference

License

Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.claude		.claude
.github/workflows		.github/workflows
bin		bin
docs		docs
lib		lib
.gitignore		.gitignore
ATTRIBUTION.md		ATTRIBUTION.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

voxagent

Quick start

Requirements

Usage

Options

How it works

Powered by

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

voxagent

Quick start

Requirements

Usage

Options

How it works

Powered by

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages