Skip to content

llama: Enable K-shift for quantized KV cache for cuda#760

Merged
ikawrakow merged 1 commit intomainfrom
fcp/kshift
Sep 5, 2025
Merged

llama: Enable K-shift for quantized KV cache for cuda#760
ikawrakow merged 1 commit intomainfrom
fcp/kshift

Commits

Commits on Sep 4, 2025