Releases · ochafik/llama.cpp

31 Jan 18:05

aa6fb13

b4607

`ci`: use sccache on windows instead of ccache (#11545)

* Use sccache on ci for windows

* Detect sccache in cmake

Assets 23

31 Jan 16:07

github-actions

b4606

a83f528

b4606

`tool-call`: fix llama 3.x and functionary 3.2, play nice w/ pydantic…

Assets 22

31 Jan 09:28

github-actions

b4604

5783575

b4604

Fix chatml fallback for unsupported builtin templates (when --jinja n…

Assets 22

31 Jan 00:34

github-actions

b4600

553f1e4

b4600

`ci`: ccache for all github worfklows (#11516)

Assets 22

30 Jan 22:39

github-actions

b4599

8b576b6

b4599

Tool call support (generic + native for Llama, Functionary, Hermes, M…

Assets 23

30 Jan 11:31

github-actions

b4595

3d804de

b4595

sync: minja (#11499)

Assets 23

29 Jan 18:36

github-actions

b4588

66ee4f2

b4588

vulkan: implement initial support for IQ2 and IQ3 quantizations (#11360)

* vulkan: initial support for IQ3_S

* vulkan: initial support for IQ3_XXS

* vulkan: initial support for IQ2_XXS

* vulkan: initial support for IQ2_XS

* vulkan: optimize Q3_K by removing branches

* vulkan: implement dequantize variants for coopmat2

* vulkan: initial support for IQ2_S

* vulkan: vertically realign code

* port failing dequant callbacks from mul_mm

* Fix array length mismatches

* vulkan: avoid using workgroup size before it is referenced

* tests: increase timeout for Vulkan llvmpipe backend

---------

Co-authored-by: Jeff Bolz <[email protected]>

Assets 23

22 Jan 17:09

github-actions

b4528

c64d2be

b4528

`minja`: sync at https://github.com/google/minja/commit/0f5f7f2b3770e…

Assets 23

22 Jan 10:32

github-actions

b4526

a94f3b2

b4526

`common`: utils to split / join / repeat strings (from json converter…

Assets 23

20 Jan 22:37

github-actions

b4519

80d0d6b

b4519

common : add -hfd option for the draft model (#11318)

* common : add -hfd option for the draft model

* cont : fix env var

* cont : more fixes

Assets 23

Releases: ochafik/llama.cpp

b4607

Uh oh!

b4606

Uh oh!

b4604

Uh oh!

b4600

Uh oh!

b4599

Uh oh!

b4595

Uh oh!

b4588

Uh oh!

b4528

Uh oh!

b4526

Uh oh!

b4519

Uh oh!