Skip to content

Commit 7920de0

Browse files
authored
[Bugfix] Fix MRoPE dispatch on CPU (vllm-project#24712)
Signed-off-by: jiang1.li <[email protected]>
1 parent ddcec28 commit 7920de0

File tree

1 file changed

+9
-0
lines changed
  • vllm/model_executor/layers/rotary_embedding

1 file changed

+9
-0
lines changed

vllm/model_executor/layers/rotary_embedding/mrope.py

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -300,6 +300,15 @@ def forward_cuda(
300300
key = torch.cat((key_rot, key_pass), dim=-1).reshape(key_shape)
301301
return query, key
302302

303+
def forward_cpu(
304+
self,
305+
positions: torch.Tensor,
306+
query: torch.Tensor,
307+
key: Optional[torch.Tensor] = None,
308+
offsets: Optional[torch.Tensor] = None,
309+
) -> tuple[torch.Tensor, Optional[torch.Tensor]]:
310+
return self.forward_native(positions, query, key, offsets)
311+
303312
@classmethod
304313
def get_input_positions(
305314
cls,

0 commit comments

Comments
 (0)