A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
-
Updated
Jul 12, 2025 - Python
A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
Kotai is a fully local, zero-cost voice assistant that combines the power of Kyutai TTS/STT, LiveKit, and local LLMs to create natural conversational experiences.
An automated installation script for deploying Kyutai's Moshi STT server on macOS Apple Silicon.
Demo repository for Kyutai Labs' STT-1B model: Real-time speech-to-text transcription with streaming inference, built-in VAD, and Jupyter notebook examples for audio processing and simulation.
A FastAPI-based Speech-to-Text service that provides OpenAI Whisper API compatibility using Kyutai's powerful STT models. This allows you to use any OpenAI Whisper client with Kyutai's models as a drop-in replacement.
LiveKit TTS plugin with Kyutai streaming implementation
Golang bindings to Kyutai Delayed Streams Modeling Rust productions servers
Working integration with Kyutai and the Omi app.
A high-performance, GPU-optimized real-time speech-to-text (STT) streaming server built with WebSocket support for multiple concurrent clients. This project leverages the Kyutai STT model and is optimized for NVIDIA RTX 4090 GPUs, providing low-latency transcription for audio streams.
Add a description, image, and links to the kyutai topic page so that developers can more easily learn about it.
To associate your repository with the kyutai topic, visit your repo's landing page and select "manage topics."