[ModelRunner][MultiModal] Remove legacy input mapper/processor from V0 #951

shen-shanshan · 2025-05-26T02:36:09Z

What this PR does / why we need it?

Remove legacy input mapper/processor from V0.

Find more details at #673 and vllm-project/vllm#15686.

Does this PR introduce any user-facing change?

no.

How was this patch tested?

Launch online service:

vllm serve Qwen/Qwen2.5-VL-7B-Instruct \
--dtype bfloat16 \
--max_model_len 32768 \
--max-num-batched-tokens 32768

Query the server:

curl http://localhost:8000/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
    "model": "Qwen/Qwen2.5-VL-7B-Instruct",
    "messages": [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": [
        {"type": "image_url", "image_url": {"url": "https://modelscope.oss-cn-beijing.aliyuncs.com/resource/qwen.png"}},
        {"type": "text", "text": "What is the text in the illustrate?"}
    ]}
    ]
    }'

Result:

{"id":"chatcmpl-619e70733ed148b3be3a0b6524ee0ef3","object":"chat.completion","created":1748226332,"model":"/home/sss/.cache/modelscope/hub/models/Qwen/Qwen2___5-VL-7B-Instruct","choices":[{"index":0,"message":{"role":"assistant","reasoning_content":null,"content":"The text in the illustration reads \"TONGYI Qwen.\"","tool_calls":[]},"logprobs":null,"finish_reason":"stop","stop_reason":null}],"usage":{"pro

shen-shanshan · 2025-05-29T03:11:26Z

I have rebased on the latest main and nothing changed.

Signed-off-by: shen-shanshan <[email protected]>

shen-shanshan · 2025-06-03T03:21:04Z

@wangxiyuan The CI of this PR is passed.

vllm-project#951) ### What this PR does / why we need it? Remove legacy input mapper/processor from V0. Find more details at vllm-project#673 and vllm-project/vllm#15686. ### Does this PR introduce _any_ user-facing change? no. ### How was this patch tested? Launch online service: ```bash vllm serve Qwen/Qwen2.5-VL-7B-Instruct \ --dtype bfloat16 \ --max_model_len 32768 \ --max-num-batched-tokens 32768 ``` Query the server: ```bash curl http://localhost:8000/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "Qwen/Qwen2.5-VL-7B-Instruct", "messages": [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": [ {"type": "image_url", "image_url": {"url": "https://modelscope.oss-cn-beijing.aliyuncs.com/resource/qwen.png"}}, {"type": "text", "text": "What is the text in the illustrate?"} ]} ] }' ``` Result: ```bash {"id":"chatcmpl-619e70733ed148b3be3a0b6524ee0ef3","object":"chat.completion","created":1748226332,"model":"/home/sss/.cache/modelscope/hub/models/Qwen/Qwen2___5-VL-7B-Instruct","choices":[{"index":0,"message":{"role":"assistant","reasoning_content":null,"content":"The text in the illustration reads \"TONGYI Qwen.\"","tool_calls":[]},"logprobs":null,"finish_reason":"stop","stop_reason":null}],"usage":{"pro ``` Signed-off-by: shen-shanshan <[email protected]> Co-authored-by: wangxiyuan <[email protected]> Signed-off-by: wangxiaoxin (A) <[email protected]>

vllm-project#951) ### What this PR does / why we need it? Remove legacy input mapper/processor from V0. Find more details at vllm-project#673 and vllm-project/vllm#15686. ### Does this PR introduce _any_ user-facing change? no. ### How was this patch tested? Launch online service: ```bash vllm serve Qwen/Qwen2.5-VL-7B-Instruct \ --dtype bfloat16 \ --max_model_len 32768 \ --max-num-batched-tokens 32768 ``` Query the server: ```bash curl http://localhost:8000/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "Qwen/Qwen2.5-VL-7B-Instruct", "messages": [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": [ {"type": "image_url", "image_url": {"url": "https://modelscope.oss-cn-beijing.aliyuncs.com/resource/qwen.png"}}, {"type": "text", "text": "What is the text in the illustrate?"} ]} ] }' ``` Result: ```bash {"id":"chatcmpl-619e70733ed148b3be3a0b6524ee0ef3","object":"chat.completion","created":1748226332,"model":"/home/sss/.cache/modelscope/hub/models/Qwen/Qwen2___5-VL-7B-Instruct","choices":[{"index":0,"message":{"role":"assistant","reasoning_content":null,"content":"The text in the illustration reads \"TONGYI Qwen.\"","tool_calls":[]},"logprobs":null,"finish_reason":"stop","stop_reason":null}],"usage":{"pro ``` Signed-off-by: shen-shanshan <[email protected]> Co-authored-by: wangxiyuan <[email protected]>

vllm-project#951) ### What this PR does / why we need it? Remove legacy input mapper/processor from V0. Find more details at vllm-project#673 and vllm-project/vllm#15686. ### Does this PR introduce _any_ user-facing change? no. ### How was this patch tested? Launch online service: ```bash vllm serve Qwen/Qwen2.5-VL-7B-Instruct \ --dtype bfloat16 \ --max_model_len 32768 \ --max-num-batched-tokens 32768 ``` Query the server: ```bash curl http://localhost:8000/v1/chat/completions \ -H "Content-Type: application/json" \ -d '{ "model": "Qwen/Qwen2.5-VL-7B-Instruct", "messages": [ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": [ {"type": "image_url", "image_url": {"url": "https://modelscope.oss-cn-beijing.aliyuncs.com/resource/qwen.png"}}, {"type": "text", "text": "What is the text in the illustrate?"} ]} ] }' ``` Result: ```bash {"id":"chatcmpl-619e70733ed148b3be3a0b6524ee0ef3","object":"chat.completion","created":1748226332,"model":"/home/sss/.cache/modelscope/hub/models/Qwen/Qwen2___5-VL-7B-Instruct","choices":[{"index":0,"message":{"role":"assistant","reasoning_content":null,"content":"The text in the illustration reads \"TONGYI Qwen.\"","tool_calls":[]},"logprobs":null,"finish_reason":"stop","stop_reason":null}],"usage":{"pro ``` Signed-off-by: shen-shanshan <[email protected]> Co-authored-by: wangxiyuan <[email protected]> Signed-off-by: wangxiaoxin (A) <[email protected]>

wangxiyuan approved these changes May 28, 2025

View reviewed changes

wangxiyuan added the ready read for review label May 28, 2025

shen-shanshan force-pushed the mm branch 2 times, most recently from ec5ae94 to b431db1 Compare May 29, 2025 03:05

update multi-modal code in V0 ModelRunner

db13d73

Signed-off-by: shen-shanshan <[email protected]>

shen-shanshan force-pushed the mm branch from b431db1 to db13d73 Compare June 3, 2025 01:30

wangxiyuan approved these changes Jun 3, 2025

View reviewed changes

Merge branch 'main' into mm

3b76adb

wangxiyuan merged commit 9386057 into vllm-project:main Jun 3, 2025
20 checks passed

Yikun mentioned this pull request Jul 12, 2025

[Deprecation]: Remove legacy input mapper/processor from V0 #673

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ModelRunner][MultiModal] Remove legacy input mapper/processor from V0 #951

[ModelRunner][MultiModal] Remove legacy input mapper/processor from V0 #951

Uh oh!

shen-shanshan commented May 26, 2025

Uh oh!

shen-shanshan commented May 29, 2025

Uh oh!

shen-shanshan commented Jun 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[ModelRunner][MultiModal] Remove legacy input mapper/processor from V0 #951

[ModelRunner][MultiModal] Remove legacy input mapper/processor from V0 #951

Uh oh!

Conversation

shen-shanshan commented May 26, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

shen-shanshan commented May 29, 2025

Uh oh!

shen-shanshan commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shen-shanshan commented Jun 3, 2025 •

edited

Loading