CUDA: Prefer vector flash decoding kernel for Gemma models#12738
Merged
JohannesGaessler merged 2 commits intoggml-org:masterfrom Apr 3, 2025
Merged
CUDA: Prefer vector flash decoding kernel for Gemma models#12738JohannesGaessler merged 2 commits intoggml-org:masterfrom
JohannesGaessler merged 2 commits intoggml-org:masterfrom