Skip to content

Commit a8d3481

Browse files
committed
clean
Signed-off-by: junq <[email protected]>
1 parent 902fb85 commit a8d3481

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

tensorrt_llm/_torch/pyexecutor/model_engine.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1218,9 +1218,7 @@ def _prepare_tp_inputs(
12181218
dtype=torch.int32,
12191219
pin_memory=True)
12201220
mrope_config['mrope_position_deltas'].append(
1221-
torch.tensor([mrope_position_deltas],
1222-
dtype=torch.int32).to('cuda',
1223-
non_blocking=True))
1221+
mrope_position_deltas.to('cuda', non_blocking=True))
12241222

12251223
extend_requests = extend_cuda_graph_dummy_requests + extend_requests
12261224
if not self._disable_overlap_scheduler and self.is_spec_decode:

0 commit comments

Comments
 (0)