Skip to content

Commit 7d701cd

Browse files
committed
fix qwen3_moe normalize_qkv
Signed-off-by: taoyuxiang <[email protected]>
1 parent 7a88f24 commit 7d701cd

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

vllm_ascend/models/qwen3_moe.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -226,9 +226,9 @@ def forward(
226226
hidden_states: torch.Tensor,
227227
kv_cache: Optional[torch.Tensor] = None,
228228
attn_metadata: Optional[AttentionMetadata] = None) -> torch.Tensor:
229-
q, k, v = self.normalize_qkv(self.qkv_proj(hidden_states), self.q_size,
230-
self.kv_size, self.head_dim,
231-
self.rms_norm_eps)
229+
qkv, _ = self.qkv_proj(hidden_states)
230+
q, k, v = self.normalize_qkv(qkv, self.q_size, self.kv_size,
231+
self.head_dim, self.rms_norm_eps)
232232

233233
if (self.torchair_graph_enabled and attn_metadata is not None and
234234
attn_metadata.attn_state == AscendAttentionState.DecodeOnly):

0 commit comments

Comments
 (0)