Skip to content
@kyutai-labs

kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

  1. moshi moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 9.7k 895

  2. pocket-tts pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    Python 3.4k 374

  3. delayed-streams-modeling delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    Python 2.9k 297

  4. hibiki hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.4k 110

  5. unmute unmute Public

    Make text LLMs listen and speak

    Python 1.2k 207

  6. moshi-finetune moshi-finetune Public

    Python 387 60

Repositories

Showing 10 of 26 repositories

Top languages

Loading…

Most used topics

Loading…