Releases · aperepel/claude-mlx-tts

01 Jan 20:50

aperepel

v1.3.0

efb3c21

v1.3.0 Latest

Latest

New Features

Streaming TTS (79% faster time-to-first-audio)
- Audio playback now starts almost immediately instead of waiting for full generation

Dynamic Audio Compression
- Added professional-grade compressor/limiter for consistent volume levels
- Default "notification punch" preset for clear, punchy TTS output
- Prevents audio clipping and sudden volume spikes

TTFT Metrics
- Time-to-first-token measurements now logged for performance monitoring

Breaking Changes

Dependency change: pyloudnorm → pedalboard
- If upgrading, run `uv sync --extra mlx` to install the new dependency

Assets 2

01 Jan 20:49

aperepel

v1.2.0

6dcf80d

v1.2.0

- Voice embeddings caching — Voice cloning now runs once at server startup.
  Subsequent requests load cached embeddings from disk, reducing per-request
  overhead by ~99% (1.5s → <10ms)

- Permission prompt notifications — Get an audio alert when Claude needs
  tool permission approval, so you don't miss prompts while away from terminal

- Separate logging for generation vs playback time for clearer performance metrics

Assets 2

01 Jan 20:49

aperepel

v1.1.1

8811e80

v1.1.1

Version 1.1.1: Fix permission hook to use venv Python for MLX TTS

Assets 2

01 Jan 20:49

aperepel

v1.1.0

814d3d6

v1.1.0

Version 1.1.0: TTS notification for tool permission prompts

Assets 2

01 Jan 20:49

aperepel

v1.0.0

daf71ca

v1.0.0

Add YouTube demo video to README

Assets 2

01 Jan 20:49

aperepel

v0.1.0

3174ef1

v0.1.0

Working implementation using subprocess calls for TTS:
- macOS 'say' command as default TTS backend
- MLX voice cloning via 'python -m mlx_audio.tts.generate' subprocess
- Claude CLI subprocess for summarization
- Threshold-based triggering (duration, tool calls, thinking keywords)

Architecture: scripts/tts-notify.py (single script, ~290 lines)
- Hook fires on Claude stop event
- Checks thresholds against transcript
- Summarizes via claude -p subprocess
- Speaks via say or mlx_audio subprocess

Known limitation: Each MLX TTS call loads ~4GB model from scratch (5-10s latency)
Next: Direct Python API integration with background daemon for sub-second response

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Releases: aperepel/claude-mlx-tts

v1.3.0

Uh oh!

v1.2.0

Uh oh!

v1.1.1

Uh oh!

v1.1.0

Uh oh!

v1.0.0

Uh oh!

v0.1.0

Uh oh!