Skip to content

Commit 373f84a

Browse files
authored
[Bugfix] Fix the error "cur batch_size is invalid" during profile_run in the torchair scenario (#3243)
### What this PR does / why we need it? Fix the error "cur batch_size is invalid" during profile_run in the torchair scenario. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? - vLLM version: v0.10.2 - vLLM main: vllm-project/vllm@releases/v0.11.0 Signed-off-by: WithHades <[email protected]>
1 parent 8870966 commit 373f84a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm_ascend/worker/model_runner_v1.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2513,7 +2513,7 @@ def profile_run(self) -> None:
25132513
if self._select_moe_comm_method(
25142514
self.mc2_tokens_capacity,
25152515
with_prefill=True) == MoECommType.MC2:
2516-
self._dummy_run(self.mc2_tokens_capacity)
2516+
self._dummy_run(self.mc2_tokens_capacity, with_prefill=True)
25172517

25182518
output = None
25192519
if get_pp_group().is_last_rank:

0 commit comments

Comments
 (0)