Skip to content

Commit 48ba146

Browse files
committed
remove valid pplx check
Signed-off-by: Bill Nell <[email protected]>
1 parent a77fb2c commit 48ba146

File tree

1 file changed

+1
-6
lines changed
  • vllm/model_executor/layers/fused_moe

1 file changed

+1
-6
lines changed

vllm/model_executor/layers/fused_moe/layer.py

Lines changed: 1 addition & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1016,14 +1016,9 @@ def naive_multicast(self, x: torch.Tensor,
10161016

10171017
return buffer
10181018

1019-
# TODO: will this be cudagraph-able? (probably not)
1020-
# This should not be necessary.
1021-
def invalid_pplx(self, hidden_states: torch.Tensor) -> bool:
1022-
return has_pplx and hidden_states.shape[0] < self.dp_size
1023-
10241019
def forward(self, hidden_states: torch.Tensor,
10251020
router_logits: torch.Tensor):
1026-
if self.use_direct_call or self.invalid_pplx(hidden_states):
1021+
if self.use_direct_call:
10271022
return self.forward_impl(hidden_states, router_logits)
10281023
else:
10291024
return torch.ops.vllm.moe_forward(hidden_states, router_logits,

0 commit comments

Comments
 (0)