You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Give your AI agents the ability to listen. Microphone capture and speech-to-text tools for MCP-compatible agents. Powered by [decibri](https://decibri.dev).
6
9
7
10
## Tools
8
11
9
12
| Tool | Description |
10
-
|------|-------------|
13
+
|------|-------------|
11
14
|`list_audio_devices`| List available microphone input devices |
12
15
|`capture_audio`| Record audio from the microphone and save as WAV |
13
16
|`voice_query`| Capture, transcribe (whisper.cpp), and query a local LLM (Ollama) |
@@ -17,7 +20,7 @@ Microphone capture and speech-to-text tools for MCP-compatible agents. Powered b
17
20
### Claude Code
18
21
19
22
```bash
20
-
claude mcp add mcp-listen -- npx mcp-listen
23
+
claude mcp add mcp-listen npx mcp-listen
21
24
```
22
25
23
26
### Claude Desktop / ChatGPT Desktop / Cursor / Windsurf / VS Code
@@ -35,7 +38,7 @@ Add to your MCP configuration:
35
38
}
36
39
```
37
40
38
-
Works with any MCP-compatible client: Claude, ChatGPT, Cursor, GitHub Copilot, Windsurf, VS Code, Gemini, Zed, and more.
41
+
Compatible with Claude Desktop, ChatGPT Desktop, Cursor, GitHub Copilot, Windsurf, VS Code, Gemini, Zed, and any MCP-compatible client.
39
42
40
43
### Global Install
41
44
@@ -79,7 +82,7 @@ Records audio from the microphone and saves as a WAV file.
79
82
**Parameters:**
80
83
81
84
| Parameter | Type | Default | Description |
82
-
|-----------|------|---------|-------------|
85
+
|----------|------|---------|-------------|
83
86
|`duration_ms`| number | 5000 | Recording duration in milliseconds (100-30000) |
84
87
|`device`| number | system default | Device index from `list_audio_devices`|
85
88
@@ -102,7 +105,7 @@ Full voice pipeline: capture audio, transcribe with whisper.cpp, send to Ollama,
102
105
**Parameters:**
103
106
104
107
| Parameter | Type | Default | Description |
105
-
|-----------|------|---------|-------------|
108
+
|-----------|------|---------|-------------|
106
109
|`duration_ms`| number | 5000 | Recording duration in milliseconds (100-30000) |
107
110
|`device`| number | system default | Device index from `list_audio_devices`|
108
111
|`whisper_model`| string | ggml-base.en.bin | Path or filename of Whisper GGML model |
@@ -132,11 +135,20 @@ The `voice_query` tool replicates the pipeline from [voxagent](https://voxagent.
132
135
133
136
The `voice_query` tool requires a Whisper GGML model file. Download one:
0 commit comments