Skip to content

Releases: s-Nick/llama.cpp

b5587

04 Jun 13:36
4825487

Choose a tag to compare

releases : use dl backend for linux release, remove arm64 linux relea…

b5518

28 May 11:32
26b79b6

Choose a tag to compare

convert : fix tensor naming conflict for llama 4 vision (#13836)

* convert : fix tensor naming conflict for llama 4 vision

* add comment

b5494

26 May 14:23
f13847c

Choose a tag to compare

server: fix regression on streamed non-chat completion w/ stops (#13785)

* more forgiving message diffs: partial stop words aren't erased, full stops are

* Add (slow) server test for completion + stream + stop

b5466

23 May 14:22
9ecf3e6

Choose a tag to compare

server : support audio input (#13714)

* server : support audio input

* add audio support on webui

b5435

20 May 15:04
a4090d1

Choose a tag to compare

llama : remove llama_kv_cache_view API + remove deprecated (#13653)

ggml-ci

b5401

16 May 08:54
bc098c3

Choose a tag to compare

minja: sync (qwen3) (#13573)

* minja: sync https://github.com/google/minja/commit/f06140fa52fd140fe38e531ec373d8dc9c86aa06

- https://github.com/google/minja/pull/67 (@grf53)
- https://github.com/google/minja/pull/66 (@taha-yassine)
- https://github.com/google/minja/pull/63 (@grf53)
- https://github.com/google/minja/pull/58

---------

Co-authored-by: ochafik <[email protected]>

b5353

12 May 11:23
95e1888

Choose a tag to compare

CUDA: fix misaligned synchronization in FA (#13469)

b5318

09 May 07:44
15e0328

Choose a tag to compare

ci : limit write permission to only the release step + fixes (#13392)

* ci : limit write permission to only the release step

* fix win cuda file name

* fix license file copy on multi-config generators

b5283

05 May 12:17
5215b91

Choose a tag to compare

clip :  fix confused naming ffn_up and ffn_down (#13290)

* clip :  fix confused naming ffn_up and ffn_down

* rm ffn_i/o/g naming

* rename n_embd, n_ff

* small fix

* no check n_ff

b5209

28 Apr 14:23
d2b2031

Choose a tag to compare

llama : (mrope) allow using normal 1D position for text token (#13138)

* llama : (mrope) use normal position for text token

* rm n_pos_per_embd from llm_graph_input_attn_temp