CUDA: use mma PTX instructions for FlashAttention#11583
Merged
JohannesGaessler merged 6 commits intoggml-org:masterfrom Feb 2, 2025
Merged
CUDA: use mma PTX instructions for FlashAttention#11583JohannesGaessler merged 6 commits intoggml-org:masterfrom
JohannesGaessler merged 6 commits intoggml-org:masterfrom