Skip to content

Commit dff8618

Browse files
committed
fix
1 parent 1e9b419 commit dff8618

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

lightllm/common/basemodel/cuda_graph.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -258,13 +258,15 @@ def warmup_overlap(self, model):
258258
)
259259
b_seq_len = torch.empty(batch_size, dtype=torch.int32, device="cuda")
260260
b_seq_len.fill_(seq_len)
261+
b_mtp_index = torch.zeros(batch_size, dtype=torch.int32, device="cuda")
261262

262263
micro_batch = ModelInput(
263264
is_prefill=False,
264265
batch_size=batch_size,
265266
total_token_num=total_token_num,
266267
max_len_in_batch=max_len_in_batch,
267268
input_ids=input_ids,
269+
b_mtp_index=b_mtp_index,
268270
mem_indexes=mem_indexes,
269271
b_req_idx=b_req_idx,
270272
b_seq_len=b_seq_len,

0 commit comments

Comments
 (0)