[Refactor] lora: reuse load_weights packed mapping by dongbo910220 · Pull Request #991 · vllm-project/vllm-omni

dongbo910220 · 2026-01-27T16:54:23Z

Based on @ZJY0516 feedback, reuse each diffusion model’s load_weights() stacked_params_mapping as the single source of truth for packed→sublayer mapping used by LoRA.

Purpose

Refactor diffusion LoRA packed→sublayer mapping to be derived from each model’s load_weights() stacked_params_mapping, avoiding duplicated/possibly divergent mappings across code paths.

Test Plan

pytest -q tests/diffusion/lora

Test Result

passed

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft.

BEFORE SUBMITTING, PLEASE READ https://github.com/vllm-project/vllm-omni/blob/main/CONTRIBUTING.md (anything written below this line will be removed by GitHub Actions)

Copilot

Pull request overview

This refactor centralizes the packed→sublayer mapping for diffusion LoRA on each model’s load_weights() stacked_params_mapping, removing duplicated mapping definitions and making LoRA behavior consistent with the weight-loading path.

Changes:

Remove diffusion transformers’ class-level packed_modules_mapping and expose their existing stacked_params_mapping via self.stacked_params_mapping inside load_weights().
Update DiffusionLoRAManager to derive packed→sublayer mappings from stacked_params_mapping, and adjust _expand_expected_modules_for_packed_layers documentation and logging accordingly.
Update LoRA manager tests to use stacked_params_mapping-based behavior and verify replacement/activation logic for packed layers still works as intended.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
`vllm_omni/diffusion/models/z_image/z_image_transformer.py`	Removes `packed_modules_mapping` and exposes `stacked_params_mapping` from `load_weights()` for Z-Image.
`vllm_omni/diffusion/models/wan2_2/wan2_2_transformer.py`	Same refactor for Wan 3D transformer; `load_weights()` now assigns `self.stacked_params_mapping`.
`vllm_omni/diffusion/models/sd3/sd3_transformer.py`	Same refactor for SD3 transformer, covering both self- and cross-attention fused projections.
`vllm_omni/diffusion/models/qwen_image/qwen_image_transformer.py`	Same refactor for Qwen Image transformer, including cross-attention and `.to_out` handling.
`vllm_omni/diffusion/models/ovis_image/ovis_image_transformer.py`	Same refactor for Ovis Image transformer, exposing packed self-/cross-attention mappings.
`vllm_omni/diffusion/models/longcat_image/longcat_image_transformer.py`	Same refactor for LongCat Image transformer, including FFN/context FFN name remapping.
`vllm_omni/diffusion/models/glm_image/glm_image_transformer.py`	Same refactor for GLM Image transformer, exposing fused QKV mappings.
`vllm_omni/diffusion/models/flux2_klein/flux2_klein_transformer.py`	Same refactor for Flux2 Klein transformer, covering fused QKV and add_kv_proj mappings.
`vllm_omni/diffusion/lora/utils.py`	Updates docstring to state packed→sublayer mapping is derived from each model’s `stacked_params_mapping`.
`vllm_omni/diffusion/lora/manager.py`	Changes `_compute_packed_modules_mapping` to derive mappings from `stacked_params_mapping` (with validation and conflict warnings) and tweaks related warning messages.
`tests/diffusion/lora/test_lora_manager.py`	Adjusts tests to use `stacked_params_mapping` on the pipeline, and aligns comments with the new mapping source while preserving behavior checks.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: dongbo910220 <1275604947@qq.com>

Gaohan123 · 2026-02-11T16:39:07Z

@dongbo910220 Please resolve conflicts. Thanks!

dongbo910220 · 2026-02-14T17:13:01Z

@dongbo910220 Please resolve conflicts. Thanks!

@Gaohan123 Resolved. Thanks!

Signed-off-by: dongbo910220 <1275604947@qq.com>

dongbo910220 · 2026-02-15T21:19:30Z

@Gaohan123 , ROCm CI is still on vLLM 0.15.0 (see docker/Dockerfile.rocm), so @config(config=...) fails with TypeError: config() got an unexpected keyword argument 'config'. This is an environment/version mismatch, not a logic regression.

The project docs explicitly recommend vLLM 0.16.0 (docs/getting_started/quickstart.md).

Could we upgrade the ROCm CI image to vLLM ≥ 0.16 (or reinstall vLLM in ROCm CI), or should we add a backward-compatibility shim?

lishunyang12 · 2026-02-21T08:00:47Z

@dongbo910220 Hey, reusing each model's load_weights() stacked_params_mapping as the single source of truth for LoRA packed→sublayer mapping is a clean refactor — avoids divergent mappings. Tests are passing too. Is this ready for merge?

dongbo910220 · 2026-02-21T14:52:37Z

@dongbo910220 Hey, reusing each model's load_weights() stacked_params_mapping as the single source of truth for LoRA packed→sublayer mapping is a clean refactor — avoids divergent mappings. Tests are passing too. Is this ready for merge?

@lishunyang12 Thanks! Yes, it’s ready to merge. Local tests are passing; the remaining CI issue is an ROCm environment mismatch (vLLM 0.15.x) and not caused by this change.

lishunyang12

Left a couple of comments. The main concern is a regression in z_image's QKV mapping.

vllm_omni/diffusion/lora/manager.py

tests/diffusion/lora/test_lora_manager.py

vllm_omni/diffusion/lora/manager.py

Signed-off-by: dongbo910220 <1275604947@qq.com>

dongbo910220 requested a review from hsliuustc0106 as a code owner January 27, 2026 16:54

hsliuustc0106 requested a review from Copilot January 28, 2026 04:42

Copilot started reviewing on behalf of hsliuustc0106 January 28, 2026 04:42 View session

Copilot AI reviewed Jan 28, 2026

View reviewed changes

ZJY0516 approved these changes Jan 31, 2026

View reviewed changes

diffusion/lora: reuse load_weights packed mapping

517eb73

Signed-off-by: dongbo910220 <1275604947@qq.com>

dongbo910220 force-pushed the refactor_diffusion_lora_packed_mapping branch from a0d52aa to 517eb73 Compare February 1, 2026 05:30

Gaohan123 added this to the v0.16.0 milestone Feb 11, 2026

Merge origin/main into refactor_diffusion_lora_packed_mapping

3a1652d

hsliuustc0106 added the ready label to trigger buildkite CI label Feb 15, 2026

dongbo910220 added 2 commits February 15, 2026 21:20

Fix OmniModelConfig config decorator

4f31574

Signed-off-by: dongbo910220 <1275604947@qq.com>

Fix OmniModelConfig decorator usage

ae296c7

Signed-off-by: dongbo910220 <1275604947@qq.com>

Merge branch 'main' into refactor_diffusion_lora_packed_mapping

934e061

lishunyang12 reviewed Feb 27, 2026

View reviewed changes

vllm_omni/diffusion/lora/manager.py Outdated Show resolved Hide resolved

tests/diffusion/lora/test_lora_manager.py Outdated Show resolved Hide resolved

vllm_omni/diffusion/lora/manager.py Show resolved Hide resolved

Fix trailing-dot packed LoRA mapping

a6d1b28

Signed-off-by: dongbo910220 <1275604947@qq.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor] lora: reuse load_weights packed mapping#991

[Refactor] lora: reuse load_weights packed mapping#991
dongbo910220 wants to merge 6 commits intovllm-project:mainfrom
dongbo910220:refactor_diffusion_lora_packed_mapping

dongbo910220 commented Jan 27, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Gaohan123 commented Feb 11, 2026

Uh oh!

dongbo910220 commented Feb 14, 2026

Uh oh!

dongbo910220 commented Feb 15, 2026 •

edited

Loading

Uh oh!

lishunyang12 commented Feb 21, 2026

Uh oh!

dongbo910220 commented Feb 21, 2026 •

edited

Loading

Uh oh!

lishunyang12 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

dongbo910220 commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Gaohan123 commented Feb 11, 2026

Uh oh!

dongbo910220 commented Feb 14, 2026

Uh oh!

dongbo910220 commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lishunyang12 commented Feb 21, 2026

Uh oh!

dongbo910220 commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lishunyang12 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

dongbo910220 commented Jan 27, 2026 •

edited

Loading

dongbo910220 commented Feb 15, 2026 •

edited

Loading

dongbo910220 commented Feb 21, 2026 •

edited

Loading