Skip to content

Commit 75aa7b4

Browse files
JohannesGaesslerggerganov
authored andcommitted
CUDA: faster FlashAttention, kernel for bs == 1
1 parent 08e69c5 commit 75aa7b4

File tree

1 file changed

+906
-451
lines changed

1 file changed

+906
-451
lines changed

0 commit comments

Comments
 (0)