[bugfix] Fix Qwen3.5 LoRA merge export producing wrong state_dict keys by Redamency · Pull Request #9057 · modelscope/ms-swift

Redamency · 2026-04-09T06:29:37Z

Issue

Root Cause

When using , the method applies which uses the model's conversion mapping (e.g. ) in reverse during saving. For composite models like Qwen3.5 (which has a submodule), this causes state_dict keys to be incorrectly prefixed.

For example:

becomes
becomes

This makes the exported model unusable.

Fix

Pass to to skip the buggy step. The fix uses to check parameter availability for backward compatibility with older transformers versions.

Testing

Verified with Qwen3.5-0.8B + LoRA merge export:

Before fix: All 473 keys had wrong prefixes
After fix: All 473 keys are correct and model loads successfully

modelscope#9046) In transformers>=5.5.0, `save_pretrained` calls `revert_weight_conversion` which incorrectly applies weight key renaming for composite models like Qwen3.5. The conversion mapping `^model.language_model -> model` causes keys like `model.language_model.layers.X.*` to be doubly prefixed as `model.language_model.language_model.language_model.layers.X.*`, and `model.visual.*` to become `model.language_model.visual.*`. Fix: Pass `save_original_format=False` to `save_pretrained` to skip the buggy `revert_weight_conversion` step. The in-memory state_dict already has correct keys matching the model's safetensors format. A version check via `inspect.signature` ensures backward compatibility with older transformers versions that lack this parameter.

gemini-code-assist

Code Review

This pull request updates save_checkpoint in swift/model/utils.py to conditionally pass save_original_format=False to save_pretrained when supported, preventing a weight conversion bug in transformers>=5.5. Regarding the review feedback, the import inspect statement should be moved to the top of the file to adhere to PEP 8 standards and avoid redundant imports during function execution.

gemini-code-assist · 2026-04-09T06:38:17Z

swift/model/utils.py

+            # that corrupts state_dict keys for composite models (e.g. Qwen3.5).
+            # See: https://github.com/modelscope/ms-swift/issues/9046
+            save_kwargs = {}
+            import inspect


According to PEP 8 style guidelines, imports should be placed at the top of the file. Please move import inspect to the top-level imports of this module. This improves code organization and avoids re-importing the module on every function call.

References

PEP 8 states that imports should be at the top of the file, just after any module comments and docstrings, and before module globals and constants. Placing imports inside functions is discouraged. ^(link)

Redamency mentioned this pull request Apr 9, 2026

Qwen3.5 lora merge export using wrong keys #9046

Open

1 task

gemini-code-assist bot reviewed Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] Fix Qwen3.5 LoRA merge export producing wrong state_dict keys#9057

[bugfix] Fix Qwen3.5 LoRA merge export producing wrong state_dict keys#9057
Redamency wants to merge 1 commit intomodelscope:mainfrom
Redamency:fix/qwen3.5-lora-merge-export-keys

Redamency commented Apr 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Redamency commented Apr 9, 2026

Issue

Root Cause

Fix

Testing

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant