Skip to content

Commit 506a32c

Browse files
author
wangzaijun
committed
fix
1 parent 5c71cce commit 506a32c

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

lightllm/common/basemodel/triton_kernel/kv_cache_offload.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -173,9 +173,9 @@ def _offload_gpu_kv_to_cpu(
173173
def offload_gpu_kv_to_cpu(
174174
token_indexes: torch.Tensor,
175175
gpu_kv_cache: torch.Tensor,
176-
gpu_kv_cache_scale: torch.Tensor,
176+
gpu_kv_cache_scale: Optional[torch.Tensor],
177177
cpu_kv_cache: torch.Tensor,
178-
cpu_kv_cache_scale: torch.Tensor,
178+
cpu_kv_cache_scale: Optional[torch.Tensor],
179179
page_indexes: torch.Tensor,
180180
page_readies: torch.Tensor,
181181
tp_index: int,

0 commit comments

Comments
 (0)