Skip to content

Add PEFT and DeepSpeed dependencies; align Qwen2Model usage and TTS prompt with MiMo-Audio#2

Open
candlewill wants to merge 4 commits intoXiaomiMiMo:mainfrom
OpenT2S:main
Open

Add PEFT and DeepSpeed dependencies; align Qwen2Model usage and TTS prompt with MiMo-Audio#2
candlewill wants to merge 4 commits intoXiaomiMiMo:mainfrom
OpenT2S:main

Conversation

@candlewill
Copy link

@candlewill candlewill commented Oct 17, 2025

This PR introduces two key improvements for consistency and training efficiency:

  • Add peft and deepspeed to requirements.txt to support parameter-efficient fine-tuning and distributed training via DeepSpeed integration.
  • Remove is_causal=True from Qwen2Model forward call in modeling_mimo_audio.py to match the original MiMo-Audio implementation.
  • Update TTS user prompt for Chinese text from "Please convert this text to speech" to "请将这段文字转换为语音" in mimo_audio.py, ensuring better alignment with the base model's training data for improved synthesis quality.

@candlewill candlewill changed the title add peft to requirements Update dependencies and align Qwen2Model usage with MiMo-Audio Oct 20, 2025
@candlewill candlewill changed the title Update dependencies and align Qwen2Model usage with MiMo-Audio Add PEFT and DeepSpeed dependencies; align Qwen2Model usage and TTS prompt with MiMo-Audio Oct 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant