support train qwen2.5-vl-32b eagle3 model by gerayking · Pull Request #437 · sgl-project/SpecForge

gerayking · 2026-01-19T06:16:36Z

Motivation

I noticed SpecForge currently supports Qwen2.5-VL-7B via Transformers. I think we should instead add systematic VL support through sglang, rather than a model-specific Transformers integration.

Modifications

Dataset preprocessing: Use transformers to generate pixel_values and image_grid_thw (already available).
Request packing (SGLang): Wrap the data into an sglang Request. When unpacking/splitting into per-request chunks, segment pixel_values by offset based on image_grid_thw, and ensure compatibility with mRoPE.
Draft model: During forward(), align mRoPE behavior with the main model.
Initialize mmCache: Set up the multimodal cache during initialization.

Related Issues

Fixes #403

Accuracy Test

TODO

Benchmark & Profiling

Checklist

Format your code according to the Code Formatting with Pre-Commit.
Add unit tests as outlined in the Running Unit Tests.
Update documentation / docstrings / example tutorials as needed, according to Writing Documentation.
Provide throughput / latency benchmark results and accuracy evaluation results as needed, according to Benchmark and Profiling and Accuracy Results.
For reviewers: If you haven't made any contributions to this PR and are only assisting with merging the main branch, please remove yourself as a co-author when merging the PR.
Please feel free to join our Slack channel at https://sgl-fru7574.slack.com/archives/C09784E3EN6 to discuss your PR.

gemini-code-assist · 2026-01-19T06:16:40Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

gerayking requested review from FlamingoPg, FrankLeeeee, shuaills, sleepcoo and zyksir as code owners January 19, 2026 06:16

gerayking force-pushed the feature/support_qwen2.5_vl_32b branch 7 times, most recently from cdd81c3 to b0034f6 Compare January 19, 2026 08:27

support qwen2.5_32b_eagle3 by sglang backend

c3528ec

gerayking force-pushed the feature/support_qwen2.5_vl_32b branch from b0034f6 to c3528ec Compare January 19, 2026 16:07

jiapingW and others added 2 commits January 20, 2026 13:01

fix unittest

4bcc8e4

fix mm_item hash is None

a8d9343

gerayking force-pushed the feature/support_qwen2.5_vl_32b branch from a5463b3 to a8d9343 Compare January 20, 2026 08:12

jiapingW merged commit 9abff47 into sgl-project:main Jan 20, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support train qwen2.5-vl-32b eagle3 model#437

support train qwen2.5-vl-32b eagle3 model#437
jiapingW merged 3 commits intosgl-project:mainfrom
gerayking:feature/support_qwen2.5_vl_32b

gerayking commented Jan 19, 2026

Uh oh!

gemini-code-assist bot commented Jan 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gerayking commented Jan 19, 2026

Motivation

Modifications

Related Issues

Accuracy Test

Benchmark & Profiling

Checklist

Uh oh!

gemini-code-assist bot commented Jan 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants