Skip to content

Commit 8e25066

Browse files
author
wangzaijun
committed
fix
1 parent 1a7a631 commit 8e25066

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

lightllm/common/basemodel/prefill_cuda_graph.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ def __init__(self, decode_cuda_graph: CudaGraph):
3131

3232
graph_handle_token_nums = []
3333
for i in range(2048):
34-
token_num = int(2 ** i)
34+
token_num = int(2 ** (2 * i))
3535
if token_num < self.max_handle_token_num:
3636
graph_handle_token_nums.append(token_num)
3737
else:

0 commit comments

Comments
 (0)