AhaTTS - Kokoro OpenAI-API

AhaTTS is an open-source, production-ready text-to-speech service with an OpenAI-compatible API and a built-in web demo.

Highlights

OpenAI-compatible API for drop-in integration
Built-in api/web demo UI
CPU / CUDA / MPS support
Multi-language and multi-voice (Kokoro)
Streaming output with high-quality audio

Quickstart

./scripts/install.sh --device cpu   # or gpu / mac
./scripts/dev.sh

Visit:

OpenAI Speech API

Request URL: http(s)://<server-address>:<port>/v1/audio/speech (POST).

Parameters:

model (string, required): one of tts-1 or tts-1-hd.
input (string, required): text to generate audio from. Max length 4096 characters.
voice (string, required): one of alloy, ash, coral, echo, fable, onyx, nova, sage, shimmer.
response_format (string, optional): audio format, default mp3. Supported: mp3, opus, aac, flac.
speed (number, optional): audio speed, default 1.0. Range 0.5 to 2.0.

Acknowledgements

Special thanks to the following projects for inspiration and reference implementations:

Kokoro-FastAPI: https://github.com/remsky/Kokoro-FastAPI
kokoro: https://github.com/hexgrad/kokoro

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
api		api
assets		assets
scripts		scripts
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AhaTTS - Kokoro OpenAI-API

Highlights

Quickstart

OpenAI Speech API

Acknowledgements

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

IHKYoung/AhaTTS

Folders and files

Latest commit

History

Repository files navigation

AhaTTS - Kokoro OpenAI-API

Highlights

Quickstart

OpenAI Speech API

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages