[Bugfix] Fix the error "cur batch_size is invalid" during profile_run in the torchair scenario (#3243)

WithHades · web-flow · commit 373f84a19332 · 2025-09-29T11:51:07.000+08:00
### What this PR does / why we need it? Fix the error "cur batch_size is invalid" during profile_run in the torchair scenario. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.10.2 - vLLM main: vllm-project/vllm@releases/v0.11.0 Signed-off-by: WithHades <244036962@qq.com>
diff --git a/vllm_ascend/worker/model_runner_v1.py b/vllm_ascend/worker/model_runner_v1.py
@@ -2513,7 +2513,7 @@ def profile_run(self) -> None:
             if self._select_moe_comm_method(
                     self.mc2_tokens_capacity,
                     with_prefill=True) == MoECommType.MC2:
-                self._dummy_run(self.mc2_tokens_capacity)
+                self._dummy_run(self.mc2_tokens_capacity, with_prefill=True)
 
         output = None
         if get_pp_group().is_last_rank: