Fix the accuracy issues caused by the mrope operator #2355

zhihaofang1017 · 2025-08-13T13:04:49Z

What this PR does / why we need it?

Fix precision and inference length issues with MRoPE operator in multi-image long sequences for Qwen2.5-VL-7B

This PR addresses two critical issues observed with the MRoPE operator:

Precision degradation in V1 model outputs
Inference length limitations (only generating a few characters) when processing multi-image long sequences

Root cause analysis revealed that most CANN operators internally perform contiguous() operations to ensure accessing contiguous data during computations. However, the ”positions“ tensor was missing this crucial step, leading to incorrect memory access and corrupted values during operator calculations.

The fix adds a contiguous() operation on the positions tensor at the Python level, ensuring proper memory layout consistency with other tensor operations.

Does this PR introduce any user-facing change?

no

How was this patch tested?

vLLM version: v0.9.1
vLLM main: vllm-project/vllm@83fc60b

Comparing the accuracy of V0 and V1, the error meets the standard

gemini-code-assist

Code Review

This pull request addresses a critical accuracy and stability issue with the MRoPE operator on Ascend NPUs. The root cause was correctly identified as a non-contiguous positions tensor being passed to the npu_mrope kernel, leading to incorrect memory access. The fix, which involves adding a .contiguous() call to the positions tensor before the kernel invocation, is direct, correct, and consistent with how other tensor arguments are handled in the same function call. This change effectively resolves the reported problem.

Signed-off-by: zhihaofang1017 <[email protected]>

gemini-code-assist bot reviewed Aug 13, 2025

View reviewed changes

github-actions bot added the module:ops label Aug 13, 2025

Fix the accuracy issues caused by the mrope operator

792a4fc

Signed-off-by: zhihaofang1017 <[email protected]>

zhihaofang1017 force-pushed the v0.9.1-dev branch from c5fcb60 to 792a4fc Compare August 13, 2025 13:18

zhihaofang1017 added 2 commits August 14, 2025 14:12

Fix the accuracy issues caused by the mrope operator 2

82d5812

Signed-off-by: zhihaofang1017 <[email protected]>

Fix the accuracy issues caused by the mrope operator 3

6c03eb9

Signed-off-by: zhihaofang1017 <[email protected]>

zhihaofang1017 closed this by deleting the head repository Aug 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the accuracy issues caused by the mrope operator #2355

Fix the accuracy issues caused by the mrope operator #2355

zhihaofang1017 commented Aug 13, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Fix the accuracy issues caused by the mrope operator #2355

Fix the accuracy issues caused by the mrope operator #2355

Conversation

zhihaofang1017 commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

zhihaofang1017 commented Aug 13, 2025 •

edited

Loading