CUDA: use async data loading for FlashAttention#11894
Merged
JohannesGaessler merged 3 commits intoggml-org:masterfrom Feb 17, 2025
Merged
CUDA: use async data loading for FlashAttention#11894JohannesGaessler merged 3 commits intoggml-org:masterfrom
JohannesGaessler merged 3 commits intoggml-org:masterfrom
Commits
Commits on Feb 15, 2025
Commits on Feb 17, 2025
- andauthored