Skip to content

CUDA: use mma PTX instructions for FlashAttention#11583

Merged
JohannesGaessler merged 6 commits intoggml-org:masterfrom
JohannesGaessler:cuda-fa-mma-5
Feb 2, 2025
Merged

CUDA: use mma PTX instructions for FlashAttention#11583
JohannesGaessler merged 6 commits intoggml-org:masterfrom
JohannesGaessler:cuda-fa-mma-5

Commits

Commits on Feb 1, 2025

Commits on Feb 2, 2025