Skip to content
@kyutai-labs

kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

  1. moshi moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 9.7k 895

  2. pocket-tts pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    Python 3.4k 374

  3. delayed-streams-modeling delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    Python 2.9k 297

  4. hibiki hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.4k 110

  5. unmute unmute Public

    Make text LLMs listen and speak

    Python 1.2k 207

  6. moshi-finetune moshi-finetune Public

    Python 387 60

Repositories

Showing 10 of 26 repositories
  • unmute Public

    Make text LLMs listen and speak

    kyutai-labs/unmute’s past year of commit activity
    Python 1,207 MIT 207 29 (3 issues need help) 1 Updated Feb 26, 2026
  • invincible-voice Public

    To bring back voice to those who lost it

    kyutai-labs/invincible-voice’s past year of commit activity
    TypeScript 56 MIT 6 6 (1 issue needs help) 1 Updated Feb 25, 2026
  • pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    kyutai-labs/pocket-tts’s past year of commit activity
    Python 3,373 MIT 373 25 (7 issues need help) 3 Updated Feb 21, 2026
  • yomikomi Public

    A small rust-based data loader

    kyutai-labs/yomikomi’s past year of commit activity
    Rust 36 Apache-2.0 2 1 1 Updated Feb 20, 2026
  • hibiki-zero Public

    A real-time and multilingual speech translation model

    kyutai-labs/hibiki-zero’s past year of commit activity
    Python 189 MIT 20 2 0 Updated Feb 13, 2026
  • moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    kyutai-labs/moshi’s past year of commit activity
    Python 9,725 Apache-2.0 895 65 14 Updated Feb 12, 2026
  • flashy Public

    Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!

    kyutai-labs/flashy’s past year of commit activity
    Python 5 MIT 0 0 0 Updated Feb 4, 2026
  • delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    kyutai-labs/delayed-streams-modeling’s past year of commit activity
    Python 2,865 Apache-2.0 297 36 0 Updated Jan 26, 2026
  • dora Public

    Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of experiments without losing your sanity.

    kyutai-labs/dora’s past year of commit activity
    Python 5 MIT 0 0 0 Updated Jan 22, 2026
  • tts_longeval Public
    kyutai-labs/tts_longeval’s past year of commit activity
    Python 30 MIT 2 0 0 Updated Jan 22, 2026

Most used topics

Loading…