Releases · s-Nick/llama.cpp

04 Jun 13:36

4825487

b5587

releases : use dl backend for linux release, remove arm64 linux relea…

Assets 17

28 May 11:32

github-actions

b5518

26b79b6

b5518

convert : fix tensor naming conflict for llama 4 vision (#13836)

* convert : fix tensor naming conflict for llama 4 vision

* add comment

Assets 18

26 May 14:23

github-actions

b5494

f13847c

b5494

server: fix regression on streamed non-chat completion w/ stops (#13785)

* more forgiving message diffs: partial stop words aren't erased, full stops are

* Add (slow) server test for completion + stream + stop

Assets 18

23 May 14:22

github-actions

b5466

9ecf3e6

b5466

server : support audio input (#13714)

* server : support audio input

* add audio support on webui

Assets 18

20 May 15:04

github-actions

b5435

a4090d1

b5435

llama : remove llama_kv_cache_view API + remove deprecated (#13653)

ggml-ci

Assets 20

16 May 08:54

github-actions

b5401

bc098c3

b5401

minja: sync (qwen3) (#13573)

* minja: sync https://github.com/google/minja/commit/f06140fa52fd140fe38e531ec373d8dc9c86aa06

- https://github.com/google/minja/pull/67 (@grf53)
- https://github.com/google/minja/pull/66 (@taha-yassine)
- https://github.com/google/minja/pull/63 (@grf53)
- https://github.com/google/minja/pull/58

---------

Co-authored-by: ochafik <[email protected]>

Assets 20

12 May 11:23

github-actions

b5353

95e1888

b5353

CUDA: fix misaligned synchronization in FA (#13469)

Assets 20

09 May 07:44

github-actions

b5318

15e0328

b5318

ci : limit write permission to only the release step + fixes (#13392)

* ci : limit write permission to only the release step

* fix win cuda file name

* fix license file copy on multi-config generators

Assets 20

05 May 12:17

github-actions

b5283

5215b91

b5283

clip :  fix confused naming ffn_up and ffn_down (#13290)

* clip :  fix confused naming ffn_up and ffn_down

* rm ffn_i/o/g naming

* rename n_embd, n_ff

* small fix

* no check n_ff

Assets 21

28 Apr 14:23

github-actions

b5209

d2b2031

b5209

llama : (mrope) allow using normal 1D position for text token (#13138)

* llama : (mrope) use normal position for text token

* rm n_pos_per_embd from llm_graph_input_attn_temp

Assets 26

Releases: s-Nick/llama.cpp

b5587

Uh oh!

b5518

Uh oh!

b5494

Uh oh!

b5466

Uh oh!

b5435

Uh oh!

b5401

Uh oh!

b5353

Uh oh!

b5318

Uh oh!

b5283

Uh oh!

b5209

Uh oh!