Add PEFT and DeepSpeed dependencies; align Qwen2Model usage and TTS prompt with MiMo-Audio by candlewill · Pull Request #2 · XiaomiMiMo/MiMo-Audio-Training

candlewill · 2025-10-17T10:14:09Z

This PR introduces two key improvements for consistency and training efficiency:

Add peft and deepspeed to requirements.txt to support parameter-efficient fine-tuning and distributed training via DeepSpeed integration.
Remove is_causal=True from Qwen2Model forward call in modeling_mimo_audio.py to match the original MiMo-Audio implementation.
Update TTS user prompt for Chinese text from "Please convert this text to speech" to "请将这段文字转换为语音" in mimo_audio.py, ensuring better alignment with the base model's training data for improved synthesis quality.

…omiMiMo/MiMo-Audio/blob/62d956b4a1a45419bee5e41f477078c3684dbbcc/src/mimo_audio/modeling_mimo_audio.py#L400C12-L400C47 when calling Qwen2Model

candlewill added 2 commits October 17, 2025 10:12

add peft to requirements

754028e

update requirements.txt

cf21ee5

candlewill force-pushed the main branch from 7818e3e to cf21ee5 Compare October 20, 2025 02:04

keep is_causal the same to the MiMo-Audio, see https://github.com/Xia…

813cfd8

…omiMiMo/MiMo-Audio/blob/62d956b4a1a45419bee5e41f477078c3684dbbcc/src/mimo_audio/modeling_mimo_audio.py#L400C12-L400C47 when calling Qwen2Model

candlewill changed the title ~~add peft to requirements~~ Update dependencies and align Qwen2Model usage with MiMo-Audio Oct 20, 2025

fix speech_loss_weights missing error

aa1c009

candlewill changed the title ~~Update dependencies and align Qwen2Model usage with MiMo-Audio~~ Add PEFT and DeepSpeed dependencies; align Qwen2Model usage and TTS prompt with MiMo-Audio Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PEFT and DeepSpeed dependencies; align Qwen2Model usage and TTS prompt with MiMo-Audio#2

Add PEFT and DeepSpeed dependencies; align Qwen2Model usage and TTS prompt with MiMo-Audio#2
candlewill wants to merge 4 commits intoXiaomiMiMo:mainfrom
OpenT2S:main

candlewill commented Oct 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

candlewill commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

candlewill commented Oct 17, 2025 •

edited

Loading