Skip to content

Releases: ochafik/llama.cpp

b4607

31 Jan 18:05
aa6fb13

Choose a tag to compare

`ci`: use sccache on windows instead of ccache (#11545)

* Use sccache on ci for windows

* Detect sccache in cmake

b4606

31 Jan 16:07
a83f528

Choose a tag to compare

`tool-call`: fix llama 3.x and functionary 3.2, play nice w/ pydantic…

b4604

31 Jan 09:28
5783575

Choose a tag to compare

Fix chatml fallback for unsupported builtin templates (when --jinja n…

b4600

31 Jan 00:34
553f1e4

Choose a tag to compare

`ci`: ccache for all github worfklows (#11516)

b4599

30 Jan 22:39
8b576b6

Choose a tag to compare

Tool call support (generic + native for Llama, Functionary, Hermes, M…

b4595

30 Jan 11:31
3d804de

Choose a tag to compare

sync: minja (#11499)

b4588

29 Jan 18:36
66ee4f2

Choose a tag to compare

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360)

* vulkan: initial support for IQ3_S

* vulkan: initial support for IQ3_XXS

* vulkan: initial support for IQ2_XXS

* vulkan: initial support for IQ2_XS

* vulkan: optimize Q3_K by removing branches

* vulkan: implement dequantize variants for coopmat2

* vulkan: initial support for IQ2_S

* vulkan: vertically realign code

* port failing dequant callbacks from mul_mm

* Fix array length mismatches

* vulkan: avoid using workgroup size before it is referenced

* tests: increase timeout for Vulkan llvmpipe backend

---------

Co-authored-by: Jeff Bolz <[email protected]>

b4528

22 Jan 17:09
c64d2be

Choose a tag to compare

`minja`: sync at https://github.com/google/minja/commit/0f5f7f2b3770e…

b4526

22 Jan 10:32
a94f3b2

Choose a tag to compare

`common`: utils to split / join / repeat strings (from json converter…

b4519

20 Jan 22:37
80d0d6b

Choose a tag to compare

common : add -hfd option for the draft model (#11318)

* common : add -hfd option for the draft model

* cont : fix env var

* cont : more fixes