Skip to content

Commit 5ad0187

Browse files
committed
Add cudnn to the BatchPrefillWithPagedKVCacheWrapper
1 parent 90d00f0 commit 5ad0187

File tree

3 files changed

+210
-152
lines changed

3 files changed

+210
-152
lines changed

benchmarks/routines/attention.py

Lines changed: 0 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -798,14 +798,6 @@ def testBatchPrefillWithPagedKVCacheWrapper(args):
798798
.int()
799799
.to(device)
800800
)
801-
qo_indptr_cudnn = torch.cat(
802-
[
803-
torch.tensor([0], device=device),
804-
torch.cumsum(actual_seq_lens_q_device.view(-1), dim=0)
805-
* head_dim_qk
806-
* num_qo_heads,
807-
]
808-
).int()
809801

810802
# Because actual_seq_lens_kv is the same as actual_seq_lens_q, kv_indptr will become the same as qo_indptr
811803
kv_indptr = (

0 commit comments

Comments
 (0)