Skip to content

Support Mimi model inside Moshi #8373

@iseeyuan

Description

@iseeyuan

🚀 The feature, motivation and pitch

Mimi is an audio codec from Kyutai. It works together with a Llama transformer for the end to end experience of speech dialogue. This issue is a milestone toward supporting Moshi and Hibiki models on devices (iPhone and Android).

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

cc @mergennachin @cccclai @helunwencser @dvorjackz

Metadata

Metadata

Assignees

Labels

module: llmIssues related to LLM examples and apps, and to the extensions/llm/ codetriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

Status

Done

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions