Skip to content

Commit dd55736

Browse files
Wangbei25Wangbei25
andauthored
fix uncompatible between fc1 and non-sp-padding (vllm-project#7643)
cherry pick vllm-project#7614 ### What this PR does / why we need it? fix uncompatible between fc1 and non-sp-padding After PR [non-sp-padding](vllm-project#7297), kimi2.5 open flashcomm1 will raise an error : The expanded size of the tensor do not match the existing size at non-singleton dimension 0. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.18.0 - vLLM-Ascend main: 9976e68 Signed-off-by: Wangbei25 <wangbei41@huawie.com> Co-authored-by: Wangbei25 <wangbei41@huawie.com>
1 parent 2ad0ca5 commit dd55736

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm_ascend/worker/model_runner_v1.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1976,7 +1976,7 @@ def dispatch_cudagraph(num_tokens, disable_full=False, valid_modes=None):
19761976
_, num_tokens_across_dp, synced_cudagraph_mode = self._sync_batch_across_dp(
19771977
num_tokens_padded=num_tokens_padded,
19781978
cudagraph_mode=cudagraph_mode.value,
1979-
allow_dp_padding=cudagraph_mode != CUDAGraphMode.NONE,
1979+
allow_dp_padding=(cudagraph_mode != CUDAGraphMode.NONE) or enable_sp(self.vllm_config),
19801980
)
19811981

19821982
# Extract DP padding if there is any

0 commit comments

Comments
 (0)