kyutai

moshi Public

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9.7k 895

pocket-tts Public

A TTS that fits in your CPU (and pocket)

Python 3.4k 376

delayed-streams-modeling Public

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2.9k 296

hibiki Public

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 1.4k 110

unmute Public

Make text LLMs listen and speak

Python 1.2k 205

moshi-finetune Public

Python 389 59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kyutai

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!