Skip to content

CUDA: use mma FA kernel for gqa > 4 on RTX 4000#15035

Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-kernel-choice
Aug 2, 2025
Merged

CUDA: use mma FA kernel for gqa > 4 on RTX 4000#15035
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-kernel-choice

Commits

Commits on Aug 2, 2025