Skip to content

[Feature Request] Qwen3VL GRPO, SFT training #1256

@eagle705

Description

@eagle705

Additional context

Our customer would like to apply RL methods (GRPO, GSPO, and SPO) to VLM with MoE (such as Qwen3-VL).

Would it be possible to extend the current VLM support to Qwen3-VL?

(cc. @terrykong, @snowmanwwg )

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions