Skip to content

Commit 930a241

Browse files
authored
[Bug] R1 Accuracy: Fix routed_scaling_factor Double Mul Issue (vllm-project#24119)
Signed-off-by: yewentao256 <[email protected]>
1 parent 457e471 commit 930a241

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

vllm/model_executor/models/deepseek_v2.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -160,7 +160,8 @@ def __init__(
160160
topk_group=config.topk_group,
161161
prefix=f"{prefix}.experts",
162162
scoring_func=config.scoring_func,
163-
routed_scaling_factor=self.routed_scaling_factor,
163+
# we do scaling outside, set factor to 1.0 to avoid double mul
164+
routed_scaling_factor=1.0,
164165
e_score_correction_bias=self.gate.e_score_correction_bias,
165166
enable_eplb=self.enable_eplb,
166167
num_redundant_experts=self.n_redundant_experts)

0 commit comments

Comments
 (0)