- mindone.diffusers: Compatible with 🤗 diffusers v0.35.2, preview supports for sota v0.36 pipelines
- mindone.transformers: Compatible with 🤗 transformers v4.57.1
- ComfyUI: Added initial ComfyUI integration support
- MindSpore: Compatible with MindSpore 2.6.0 - 2.7.1
- Major upgrade: Enhanced compatibility with 🤗 transformers v4.54 and v4.57.1. Check supported models here.
-
Vision Models:
-
Audio/Speech Models:
-
Text/Language Models:
- Llama4 (#1470)
- Arcee (#1470)
- Falcon H1 (#1465)
- Dots1 (#1469)
- SmolLM3 (v4.54.1) (#1391)
- ModernBERT Decoder (v4.54.1) (#1397)
- Hunyuan V1 Dense/MoE (v4.57.1) (#1401)
- Evolla (v4.54.1) (#1440)
- EXAONE (#1396)
- Doge (#1392)
- ERNIE 4.5 & ERNIE 4.5 MoE (#1393)
- GLM4 MoE (#1409)
- Flex OLMo (#1442)
- T5Gemma (#1420)
- VaultGemma (#1450)
- BLT/Apertus/Ministral (#1462)
- EOMT/TimesFM (#1403)
- Seed OSS (#1441)
- xLSTM (#1466)
- d_fine, GraniteMoeHybrid, EfficientLoFTR Models (#1405)
-
Multimodal Models:
- Qwen3 Omni (#1411)
- Qwen3 Next (#1476)
- ColQwen2 (v4.54.1) (#1414)
- Cohere2 Vision (v4.57.1) (#1473)
- InternVL (v4.57) (#1463)
- Janus (v4.57) (#1463)
- Kosmos-2.5 (#1456)
- LFM2/LFM2-VL (#1456)
- MetaCLIP 2 (#1456)
- Mlcd (#1472)
- SAM2 (#1426)
- SAM2 Video Support (#1434)
- Olmo3 Model (#1467)
- DeepseekV2/DeepseekVL/DeepseekVLHybrid (#1477)
- MM Grounding DINO (#1486)
- Qwen2.5VL ImageProcessor Fast / VideoProcessor (#1429)
- Qwen3_VL Video Processor & Qwen2_VL Image Processor Fast (#1419)
- Phi4/Whisper/Ultravox/InternVL/Qwen2_audio/MiniCPMV/LLaVA-Next/LLaVA-Next-Video processors (#1471)
- Fixed some diffusers bugs (#1448)
- Added ComfyUI root files and CLI args (#1480)
- Added text encoder files (#1481)
- Updated clip_model.py (#1479)
- Added Wan2.2 LoRA finetune support (#1418)
- Updated Emu3 performance for MindSpore 2.6.0 and 2.7.0 (#1417)
- Updated HunyuanVideo-I2V to mindspore 2.6.0 and 2.7.0 (#1385)
- Add accelerated dit pipelines compatible with mindspore Graph Mode (#1433)
- Added Fb cache taylorseer graph mode implementation for Flux.1 (#1475)
- Fixed AIMv2/Arcee rely on torch bug (#1485)
- Fixed bugs of mindone.transformers models that rely on torch (#1482)
- Fixed Qwen2.5VLProcessor tokenizer converting tensor bug (#1483)
- Fixed Qwen3_VL text attention selection bug (#1455)
- Fixed GLM4.1V bs>1 generation index bug (#1437)
- Fixed training issue in TrainOneStepWrapper (#1408)
- Fixed import error if env contains accelerate module (#1431)
- ZeRO: Support training with MS 2.6.0 and 2.7.0 (#1383)
- Misc bugfixes (#1424)
- Docs updates for mindone v0.5.0 release, and ut fixes (#1484)
- Total commits: 374
- Files changed: 807
- Lines added: 156,792
- Lines deleted: 23,531