Skip to content

CUDA: faster tile FA (Pascal/AMD), headsize 256#15769

Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-tile-256-7
Sep 6, 2025
Merged

CUDA: faster tile FA (Pascal/AMD), headsize 256#15769
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-tile-256-7

Commits

Commits on Sep 3, 2025