Edge AI TTS Server

A real-time text-to-speech server with HTTP API and local audio playback.

Building

cmake -Bbuild -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=$PWD/install
cmake --build build
cmake --install build

Requirements

CMake 3.26+
C++17 compiler
PortAudio (brew install portaudio on macOS, sudo apt install portaudio19-dev on Linux)

Running

./build/tts_server <model.onnx> <model.onnx.json> <espeak-ng-data>

The server listens on http://0.0.0.0:9999.

API Endpoints

Speak Text (with local playback)

# Speak immediately
curl -X POST http://localhost:9999/ -d "Hello world"

# Stream text (buffers until punctuation)
curl -X POST http://localhost:9999/stream -d "Hello, "
curl -X POST http://localhost:9999/stream -d "world."
curl -X POST http://localhost:9999/flush

# Cancel playback
curl -X POST http://localhost:9999/cancel

Synthesize to File

curl -X POST http://localhost:9999/synthesize -d "Hello world" -o output.raw

# Convert to MP3
ffmpeg -f f32le -ar 22050 -ac 1 -i output.raw output.mp3

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
docs		docs
include		include
src		src
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
main.cpp		main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Edge AI TTS Server

Building

Requirements

Running

API Endpoints

Speak Text (with local playback)

Synthesize to File

License

About

Uh oh!

Releases

Packages

Languages

License

RunEdgeAI/tts-server

Folders and files

Latest commit

History

Repository files navigation

Edge AI TTS Server

Building

Requirements

Running

API Endpoints

Speak Text (with local playback)

Synthesize to File

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages