Skip to content

Fix qwen3 vl moe execution failure#992

Open
shepark wants to merge 4 commits intovllm-project:releases/v0.15.1from
shepark:shepark/fix_qwen3_vl_moe_failure
Open

Fix qwen3 vl moe execution failure#992
shepark wants to merge 4 commits intovllm-project:releases/v0.15.1from
shepark:shepark/fix_qwen3_vl_moe_failure

Conversation

@shepark
Copy link
Contributor

@shepark shepark commented Feb 19, 2026

These are the changes for the qwen3 vl moe model corresponding to 3a7195e
and to the upstream (v0.15.1) change (vllm-project/vllm@8edaf38)

@shepark shepark marked this pull request as draft February 19, 2026 01:23
@shepark shepark force-pushed the shepark/fix_qwen3_vl_moe_failure branch from 4198669 to 6dc9601 Compare February 19, 2026 03:37
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@shepark shepark force-pushed the shepark/fix_qwen3_vl_moe_failure branch from 6dc9601 to 5afa931 Compare February 20, 2026 00:54
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@shepark shepark marked this pull request as ready for review February 20, 2026 00:57
@shepark shepark force-pushed the shepark/fix_qwen3_vl_moe_failure branch from 5afa931 to 1846288 Compare February 20, 2026 01:01
# 1st changes: overide only forward func to support 3d tensor input
# convert (B, S, H) -> (B*S, H)
# 2nd changes: corresponding changes to upstream (8edaf385)
# SharedFusedMoE support
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NIT: This dev comment looks PR specific, can we make it more universal and as a docstring?
class HpuQwen3MoeSparseMoeBlock(UpstreamQwen3MoeSparseMoeBlock): """Override forward to handle 3D tensor input (B,S,H) -> (B*S,H) and SharedFusedMoE tuple returns."""

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NIT: This dev comment looks PR specific, can we make it more universal and as a docstring? class HpuQwen3MoeSparseMoeBlock(UpstreamQwen3MoeSparseMoeBlock): """Override forward to handle 3D tensor input (B,S,H) -> (B*S,H) and SharedFusedMoE tuple returns."""

Thank you for the review. I will update this.

Copy link
Collaborator

@adobrzyn adobrzyn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@shepark shepark force-pushed the shepark/fix_qwen3_vl_moe_failure branch from 1846288 to fb61400 Compare February 20, 2026 15:25
Signed-off-by: Seunghyuk Park <separk@habana.ai>
@shepark shepark force-pushed the shepark/fix_qwen3_vl_moe_failure branch from fb61400 to 3c68bf1 Compare February 20, 2026 15:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants