Releases · ochafik/llama.cpp

26 May 14:22

d74e94c

b5495

`server`: fix format of streamed tool call deltas (diff name, fix id …

Assets 18

26 May 14:03

github-actions

b5494

f13847c

b5494

server: fix regression on streamed non-chat completion w/ stops (#13785)

* more forgiving message diffs: partial stop words aren't erased, full stops are

* Add (slow) server test for completion + stream + stop

Assets 18

26 May 11:57

github-actions

b5493

79c137f

b5493

examples : allow extracting embeddings from decoder contexts (#13797)

ggml-ci

Assets 18

25 May 23:42

github-actions

b5488

e121edc

b5488

`server`: add `--reasoning-budget 0` to disable thinking (incl. qwen3…

Assets 18

25 May 07:21

github-actions

b5479

7cea29b

b5479

server: fix/test add_generation_prompt

Assets 18

25 May 07:18

github-actions

b5478

f5cd27b

b5478

`server`: streaming of tool calls and thoughts when `--jinja` is on (…

Assets 18

24 May 08:30

github-actions

b5470

b775345

b5470

ci : enable winget package updates (#13734)

Assets 18

23 May 09:53

github-actions

b5465

faaaff5

b5465

CANN: Support MUL_MAT_ID for q8_0 and q4_0 (#13705)

* [CANN]Support MUL_MAT_ID Q8 && Q4

Signed-off-by: noemotiovon <[email protected]>

* codestyle adjustment

Signed-off-by: noemotiovon <[email protected]>

---------

Signed-off-by: noemotiovon <[email protected]>

Assets 18

16 May 22:36

github-actions

b5410

3e0be1c

b5410

llguidance : official v0.7.20 release (no actual changes) [noci] (#13…

Assets 19

15 May 22:49

github-actions

b5401

bc098c3

b5401

minja: sync (qwen3) (#13573)

* minja: sync https://github.com/google/minja/commit/f06140fa52fd140fe38e531ec373d8dc9c86aa06

- https://github.com/google/minja/pull/67 (@grf53)
- https://github.com/google/minja/pull/66 (@taha-yassine)
- https://github.com/google/minja/pull/63 (@grf53)
- https://github.com/google/minja/pull/58

---------

Co-authored-by: ochafik <[email protected]>

Assets 20

Releases: ochafik/llama.cpp

b5495

Uh oh!

b5494

Uh oh!

b5493

Uh oh!

b5488

Uh oh!

b5479

Uh oh!

b5478

Uh oh!

b5470

Uh oh!

b5465

Uh oh!

b5410

Uh oh!

b5401

Uh oh!