CUDA: optimize FA for GQA + large batches#12014
Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom Feb 22, 2025
Merged
CUDA: optimize FA for GQA + large batches#12014JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler merged 1 commit intoggml-org:masterfrom