Releases: ochafik/llama.cpp
Releases · ochafik/llama.cpp
b4607
`ci`: use sccache on windows instead of ccache (#11545) * Use sccache on ci for windows * Detect sccache in cmake
b4606
`tool-call`: fix llama 3.x and functionary 3.2, play nice w/ pydantic…
b4604
Fix chatml fallback for unsupported builtin templates (when --jinja n…
b4600
`ci`: ccache for all github worfklows (#11516)
b4599
Tool call support (generic + native for Llama, Functionary, Hermes, M…
b4595
sync: minja (#11499)
b4588
vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360) * vulkan: initial support for IQ3_S * vulkan: initial support for IQ3_XXS * vulkan: initial support for IQ2_XXS * vulkan: initial support for IQ2_XS * vulkan: optimize Q3_K by removing branches * vulkan: implement dequantize variants for coopmat2 * vulkan: initial support for IQ2_S * vulkan: vertically realign code * port failing dequant callbacks from mul_mm * Fix array length mismatches * vulkan: avoid using workgroup size before it is referenced * tests: increase timeout for Vulkan llvmpipe backend --------- Co-authored-by: Jeff Bolz <[email protected]>
b4528
`minja`: sync at https://github.com/google/minja/commit/0f5f7f2b3770e…
b4526
`common`: utils to split / join / repeat strings (from json converter…
b4519
common : add -hfd option for the draft model (#11318) * common : add -hfd option for the draft model * cont : fix env var * cont : more fixes