Skip to content

Releases: EAddario/llama.cpp

b5497

26 May 15:46
03f582a

Choose a tag to compare

server: fix streaming crashes (#13786)

* add preludes to content on partial regex match

* allow all parsers to parse non-tool-call content.

* tweak order of <|python_tag|> vs <function= parsing for functionary v3.1 format. still not ideal but hopefully less prone to crash

b5478

25 May 07:14
f5cd27b

Choose a tag to compare

`server`: streaming of tool calls and thoughts when `--jinja` is on (…

b5476

24 May 22:21
17fc817

Choose a tag to compare

releases : enable openmp in windows cpu backend build (#13756)

b5373

14 May 08:06
be1d4a1

Choose a tag to compare

scripts : fix compare-llama-bench.py show parameter (#13514)

b5343

11 May 08:45
62d4250

Choose a tag to compare

docs : Fix typo in InternVL3 model name (#13440)

b5269

03 May 07:07
1d36b36

Choose a tag to compare

llama : move end-user examples to tools directory (#13249)

* llama : move end-user examples to tools directory

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

b5215

29 Apr 07:15
5f5e39e

Choose a tag to compare

model : Nomic Embed Text V2 with Mixture-of-Experts (MoE) architectur…

b5200

27 Apr 22:56
c0a97b7

Choose a tag to compare

llama-bench : Add `--override-tensors` arg (#12922)

* Add --override-tensors option to llama-bench

* Correct llama-bench --override-tensors to --override-tensor

* llama-bench: Update --override-tensors parsing to match --tensor-split, appear in test matrix.

* Make new llama-bench util functions static to fix Ubuntu CI

* llama-bench: Correct -ot corner cases (No -ot calls, leading and trailing empty -ot spans, etc.)

b5191

25 Apr 21:46
295354e

Choose a tag to compare

llama : fix K-shift with quantized K and BLAS backend (#13113)

b5156

19 Apr 11:34
37b9f0d

Choose a tag to compare

clip : refactor, add `image_manipulation` and `llava_uhd` classes (#1…