Skip to content

Commit 6c24fa8

Browse files
committed
ran into some obscure compilation error in true triton fashion. get around it by separating gathering of gradients for diagonal block causal from the sparse kv blocks, parallelize later
1 parent 446a033 commit 6c24fa8

File tree

4 files changed

+362
-101
lines changed

4 files changed

+362
-101
lines changed

0 commit comments

Comments
 (0)