Skip to content

Commit 3fb686f

Browse files
committed
fix flashifner enable disable calculation
Signed-off-by: XIn Li <[email protected]>
1 parent 005ae9b commit 3fb686f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/compilation/collective_fusion.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -176,7 +176,7 @@ def call_trtllm_fused_allreduce_norm(
176176
use_flashinfer = allreduce_in.shape[0] * allreduce_in.shape[
177177
1] * allreduce_in.element_size() <= min(
178178
_FI_MAX_SIZES[world_size],
179-
max_token_num * allreduce_in.shape[0] *
179+
max_token_num * allreduce_in.shape[1] *
180180
allreduce_in.element_size(),
181181
)
182182
if use_flashinfer:

0 commit comments

Comments
 (0)