[Refactor] Simplify dummy data generation by DarkLight1337 · Pull Request #35025 · vllm-project/vllm

DarkLight1337 · 2026-02-21T14:40:15Z

Purpose

Always pass the kwargs from MM config when constructing HF processor. This removes the need to pass them explicitly to dummy data generation ([Multimodal] Expose mm_processor_kwargs for DummyInputsBuilder #34330)
Always pass a dictionary to mm_options, so there is no need for implementations to handle the case where it is None

Test Plan

Using the same test as #34330

Test Result

(Worker_TP2 pid=1233399) INFO 02-21 15:00:08 [gpu_model_runner.py:5150] Encoder cache will be initialized with a budget of 114688 tokens, and profiled with 1 video items of the maximum feature size.
(Worker_TP3 pid=1233400) INFO 02-21 15:00:08 [gpu_model_runner.py:5150] Encoder cache will be initialized with a budget of 114688 tokens, and profiled with 1 video items of the maximum feature size.
(Worker_TP1 pid=1233398) INFO 02-21 15:00:08 [gpu_model_runner.py:5150] Encoder cache will be initialized with a budget of 114688 tokens, and profiled with 1 video items of the maximum feature size.
(Worker_TP0 pid=1233397) INFO 02-21 15:00:08 [gpu_model_runner.py:5150] Encoder cache will be initialized with a budget of 114688 tokens, and profiled with 1 video items of the maximum feature size.

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

dosubot · 2026-02-21T14:40:24Z

Related Documentation

Checked 0 published document(s) in 1 knowledge base(s). No updates required.

^{How did I do? Any feedback?}

mergify · 2026-02-21T14:40:51Z

Documentation preview: https://vllm--35025.org.readthedocs.build/en/35025/

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist

Code Review

This pull request refactors the dummy data generation for multimodal models by centralizing the merging of keyword arguments from the multimodal configuration. This simplifies the get_dummy_mm_data and get_hf_processor methods across numerous model implementations. While the refactor is comprehensive, there are a few inconsistencies in the updated method signatures and missing keyword argument support in some custom processor adapters that could lead to runtime errors.

vllm/model_executor/models/kimi_vl.py

vllm/model_executor/models/transformers/multimodal.py

vllm/model_executor/models/isaac.py

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

[Refactor] Simplify dummy data generation

35a5556

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested a review from Isotr0py February 21, 2026 14:40

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Feb 21, 2026

DarkLight1337 requested review from NickLucche, ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, patrickvonplaten, robertgshaw2-redhat, sighingnow, tjtanaa, tlrmchlsmth, yewentao256, youkaichao and ywang96 as code owners February 21, 2026 14:40

mergify bot added documentation Improvements or additions to documentation deepseek Related to DeepSeek models llama Related to Llama models multi-modality Related to multi-modality (#4194) qwen Related to Qwen models speculative-decoding labels Feb 21, 2026

Fix

9555e29

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist bot reviewed Feb 21, 2026

View reviewed changes

vllm/model_executor/models/kimi_vl.py Show resolved Hide resolved

vllm/model_executor/models/transformers/multimodal.py Show resolved Hide resolved

vllm/model_executor/models/isaac.py Show resolved Hide resolved

DarkLight1337 added 3 commits February 21, 2026 14:58

Fix

c554242

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

7a66ffd

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

5bcf7b0

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

eb2dd45

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Isotr0py approved these changes Feb 21, 2026

View reviewed changes

DarkLight1337 added 2 commits February 21, 2026 16:11

Fix

08861bc

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

1e4bbd6

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

[Refactor] Simplify dummy data generation#35025

[Refactor] Simplify dummy data generation#35025
DarkLight1337 wants to merge 8 commits intovllm-project:mainfrom
DarkLight1337:simplify-dummy-data

DarkLight1337 commented Feb 21, 2026 •

edited by github-actions bot

Loading

Uh oh!

dosubot bot commented Feb 21, 2026

Uh oh!

mergify bot commented Feb 21, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Comments

Conversation

DarkLight1337 commented Feb 21, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

dosubot bot commented Feb 21, 2026

Uh oh!

mergify bot commented Feb 21, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DarkLight1337 commented Feb 21, 2026 •

edited by github-actions bot

Loading