Skip to content

CUDA: attention sinks for mma FlashAttention#15157

Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-mma-sink-3
Aug 8, 2025
Merged

CUDA: attention sinks for mma FlashAttention#15157
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-mma-sink-3

Commits

Commits on Aug 7, 2025