[v4] refactor ms-swift v4 by Jintao-Huang · Pull Request #7238 · modelscope/ms-swift

Jintao-Huang · 2025-12-29T09:26:53Z

No description provided.

gemini-code-assist · 2025-12-29T09:27:28Z

Summary of Changes

Hello @Jintao-Huang, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly refactors the model registration system by enhancing the flexibility of template assignments. It shifts the definition of the template parameter from the broader ModelMeta to individual ModelGroup instances, enabling more precise control over model variants. This change also involves a substantial consolidation of specific model types into more general categories, simplifying the overall model constant definitions and updating all affected model registration calls and documentation accordingly.

Highlights

Refactored Model Template Definition: The template parameter, previously a required attribute of ModelMeta, has been moved to an optional attribute within ModelGroup. This allows for more granular template specification for different model variants within the same overall model type.
Consolidation of Model Types: Numerous specific LLMModelType and MLLMModelType entries (e.g., qwen2_5_math, qwq, ernie_thinking, yi_coder) have been removed from swift/llm/model/constant.py. These specific model variants are now handled more flexibly by specifying their template directly within their respective ModelGroup definitions, leading to a cleaner and more streamlined model registration system.
Updated Model Registration Calls: Across various model definition files, the template argument in register_model calls has been changed from a positional argument to an explicit keyword argument (template=TemplateType.xxx or template='custom'). Additionally, many ModelGroup definitions now directly embed their TemplateType.
Documentation and Type System Updates: Documentation files have been updated to reflect that the template parameter for ModelMeta is no longer required and defaults to None. Assertions have also been added in ModelGroup and ModelMeta for type checking related to the template and get_function attributes.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant and well-executed refactoring of the model registration mechanism. By making the template parameter optional in ModelMeta and introducing it in ModelGroup, you've successfully consolidated numerous model registrations, which greatly simplifies the codebase and improves maintainability. The changes are consistently applied across all relevant files, including documentation. I have a couple of minor suggestions for further cleanup, but overall, this is a great improvement.

gemini-code-assist · 2025-12-29T09:30:03Z

swift/model/constant.py

@@ -8,19 +8,10 @@ class LLMModelType:
    qwen = 'qwen'
    qwen2 = 'qwen2'
    qwen2_5 = 'qwen2_5'


The model_type qwen2_5 is kept here, but its registration in swift/llm/model/model/qwen.py has been removed. The models previously under this type are now merged into the qwen2 model registration, using qwen2_5 as a template type. To maintain consistency and avoid potential confusion, it would be best to remove qwen2_5 from LLMModelType as it's no longer a registered model type.

gemini-code-assist · 2025-12-29T09:30:03Z

swift/model/models/qwen.py

                tags=['math']),
+            ModelGroup([Model('AIDC-AI/Marco-o1', 'AIDC-AI/Marco-o1')], TemplateType.marco_o1),


There's an extra space at the beginning of this line, which makes the indentation inconsistent with the surrounding ModelGroup definitions. Please remove the leading space for consistency.

Suggested change

ModelGroup([Model('AIDC-AI/Marco-o1', 'AIDC-AI/Marco-o1')], TemplateType.marco_o1),

ModelGroup([Model('AIDC-AI/Marco-o1', 'AIDC-AI/Marco-o1')], TemplateType.marco_o1),

…_template' into refactor_model_type_template

Jintao-Huang · 2026-01-13T07:04:24Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a major refactoring for the v4 release of ms-swift. The changes are extensive and well-executed, significantly improving the project's structure and maintainability. Key changes include a reorganization of the package structure, with the swift.llm and swift.plugin modules being broken down into more focused sub-packages like swift.model, swift.template, swift.dataset, swift.infer_engine, and swift.arguments. This is accompanied by several API changes, such as renaming get_model_tokenizer to get_model_processor, PtEngine to TransformersEngine, and TrainArguments to SftArguments. The model loading mechanism has also been refactored to use a more object-oriented ModelLoader approach. All documentation and examples have been diligently updated to reflect these new APIs. The refactoring appears consistent and thoughtful, and I found no issues or bugs in the implementation. This is a great step forward for the library.

…_template' into refactor_model_type_template

Jintao-Huang · 2026-01-13T13:10:27Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a major refactoring of the ms-swift library for version 4.0. The swift.llm module has been broken down into several more specific and well-organized packages such as swift.model, swift.template, swift.dataset, swift.infer_engine, and swift.pipelines. This significantly improves the project structure and maintainability. Many classes and functions have been renamed for clarity and consistency, such as PtEngine to TransformersEngine and get_model_tokenizer to get_model_processor. The documentation and examples have been updated accordingly to reflect these changes. Overall, this is an excellent and well-executed refactoring effort. I've found a couple of minor issues in the English documentation where warning messages were not translated.

update

bab8cd1

Jintao-Huang added 2 commits December 29, 2025 17:28

Merge branch 'main' into refactor_model_type_template

a5b0192

lint pass

6c3799f

gemini-code-assist bot reviewed Dec 29, 2025

View reviewed changes

Jintao-Huang added 23 commits December 29, 2025 19:06

update

7356963

update

4518360

update

cb779de

update

8eb5e8a

Merge branch 'main' into refactor_model_type_template

1ab87b1

update

208536d

update

af8cef7

Merge remote-tracking branch 'refs/remotes/origin/refactor_model_type…

c17c9e5

…_template' into refactor_model_type_template

Merge branch 'main' into refactor_model_type_template

999529c

update

d475287

update

c497f81

update

bc901d0

update

86f49a2

update

6b2cfc4

update

a7d1e47

Merge branch 'main' into refactor_model_type_template

2b7ad87

update

58e3c5f

Merge branch 'main' into refactor_model_type_template

78a0397

fix

c82e54e

update

ccc11f4

update

7996307

update

0bb3103

update

0da5034

Jintao-Huang changed the title ~~[v4] refactor model_type template~~ [v4] refactor ms-swift v4 Jan 4, 2026

Jintao-Huang marked this pull request as draft January 4, 2026 07:09

Jintao-Huang added 4 commits January 13, 2026 14:25

fix grpo

6bba51a

fix

16456ca

fix

f1a995a

fix gkd megatron

5da7607

gemini-code-assist bot reviewed Jan 13, 2026

View reviewed changes

Jintao-Huang added 12 commits January 13, 2026 15:09

fix

2aeec3f

fix

6994256

Merge remote-tracking branch 'refs/remotes/origin/refactor_model_type…

9947f1a

…_template' into refactor_model_type_template

fix torch_dtype

b3978ff

fix

5aad7f2

update

e4710d6

fix max_model_len

75b56bf

fix

14e9eb4

fix

d0a6c46

fix

ff705ea

update

adc4184

update

2a81ec3

gemini-code-assist bot reviewed Jan 13, 2026

View reviewed changes

Jintao-Huang added 8 commits January 13, 2026 21:40

fix unsloth

448962e

fix

49eeef2

fix grpo

bc7276c

fix

eada51b

Merge branch 'main' into refactor_model_type_template

8c5be30

fix model

f85f0c9

fix

98a2be2

Merge branch 'main' into refactor_model_type_template

0171487

Jintao-Huang merged commit 67ae607 into modelscope:main Jan 14, 2026
2 of 3 checks passed

meichangsu1 pushed a commit to tpx818/ms-swift that referenced this pull request Jan 22, 2026

[v4] refactor ms-swift v4 (modelscope#7238)

b7f9bc0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v4] refactor ms-swift v4#7238

[v4] refactor ms-swift v4#7238
Jintao-Huang merged 146 commits intomodelscope:mainfrom
Jintao-Huang:refactor_model_type_template

Jintao-Huang commented Dec 29, 2025

Uh oh!

gemini-code-assist bot commented Dec 29, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 29, 2025

Uh oh!

gemini-code-assist bot Dec 29, 2025

Uh oh!

Jintao-Huang commented Jan 13, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Jintao-Huang commented Jan 13, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		tags=['math']),
		ModelGroup([Model('AIDC-AI/Marco-o1', 'AIDC-AI/Marco-o1')], TemplateType.marco_o1),

Conversation

Jintao-Huang commented Dec 29, 2025

Uh oh!

gemini-code-assist bot commented Dec 29, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

Jintao-Huang commented Jan 13, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Jintao-Huang commented Jan 13, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants