Skip to content

Commit 33ee436

Browse files
JartXjingyu
authored andcommitted
[BUGFIX] KeyError 'layers.14.mlp.gate.g_idx' for Qwen3-MoE with GPTQ on ROCm (vllm-project#22017)
Signed-off-by: jingyu <[email protected]>
1 parent a739c9c commit 33ee436

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/models/qwen3_moe.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -149,7 +149,7 @@ def __init__(
149149
self.gate = ReplicatedLinear(config.hidden_size,
150150
config.num_experts,
151151
bias=False,
152-
quant_config=None,
152+
quant_config=quant_config,
153153
prefix=f"{prefix}.gate")
154154

155155
def forward(self, hidden_states: torch.Tensor) -> torch.Tensor:

0 commit comments

Comments
 (0)