Skip to content

Commit 1e55dfa

Browse files
authored
[BUGFIX] KeyError 'layers.14.mlp.gate.g_idx' for Qwen3-MoE with GPTQ on ROCm (#22017)
1 parent 384a052 commit 1e55dfa

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/models/qwen3_moe.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -149,7 +149,7 @@ def __init__(
149149
self.gate = ReplicatedLinear(config.hidden_size,
150150
config.num_experts,
151151
bias=False,
152-
quant_config=None,
152+
quant_config=quant_config,
153153
prefix=f"{prefix}.gate")
154154

155155
def forward(self, hidden_states: torch.Tensor) -> torch.Tensor:

0 commit comments

Comments
 (0)