Skip to content

Commit 7a7123f

Browse files
KingsleyZhang123Aleksandr Malyshev
authored andcommitted
[BugFix][AMD][Deepseek] fix a dtype mismatch error for deepseek running on AMD (vllm-project#23864)
Signed-off-by: Jinghui Zhang <[email protected]>
1 parent 78aa33e commit 7a7123f

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -279,7 +279,7 @@ def rocm_aiter_grouped_topk(
279279
if e_score_correction_bias is not None:
280280
torch.ops.vllm.rocm_aiter_biased_grouped_topk(
281281
gating_output,
282-
e_score_correction_bias,
282+
e_score_correction_bias.to(gating_output.dtype),
283283
topk_weights,
284284
topk_ids,
285285
num_expert_group,
@@ -409,15 +409,15 @@ def shuffle_weights(
409409
*tensors: torch.Tensor, layout: tuple[int, int] = (16, 16)
410410
) -> tuple[torch.Tensor, ...]:
411411
"""
412-
Applies shuffle_weight function from AITER to each
412+
Applies shuffle_weight function from AITER to each
413413
input tensor and returns them.
414-
414+
415415
Rearranges (shuffles) the input tensor/s
416416
into a specified block layout for optimized computation.
417417
418418
Args:
419419
*tensors: Variable number of torch.Tensor objects.
420-
layout: A pair of integers specifying the
420+
layout: A pair of integers specifying the
421421
block sizes used to divide the tensors during shuffling.
422422
Default is (16, 16).
423423

0 commit comments

Comments
 (0)