Releases · AD2605/llama.cpp

27 May 10:29

f9cd683

b5503

sampling : make sure samplers return at least 1 token (#13822)

* sampling : min-p should always return at least one token

ggml-ci

* sampling : same for typical sampling

* tests : sampling tests use min_keep == 0

ggml-ci

Assets 18

23 May 16:36

github-actions

b5467

8a2afb7

b5467

llama : allow custom list of swa_layers (#13726)

Assets 18

20 May 10:10

github-actions

b5432

4245e62

b5432

sycl: disable reorder for sycl mulmat (#13536)

Assets 20

19 May 12:23

github-actions

b5423

92ecdcc

b5423

mtmd : add vision support for llama 4 (#13282)

* wip llama 4 conversion

* rm redundant __init__

* fix conversion

* fix conversion

* test impl

* try this

* reshape patch_embeddings_0

* fix view

* rm ffn_post_norm

* cgraph ok

* f32 for pos embd

* add image marker tokens

* Llama4UnfoldConvolution

* correct pixel shuffle

* fix merge conflicts

* correct

* add debug_graph

* logits matched, but it still preceives the image incorrectly

* fix style

* add image_grid_pinpoints

* handle llama 4 preprocessing

* rm load_image_size

* rm unused line

* fix

* small fix 2

* add test & docs

* fix llava-1.6 test

* test: add notion of huge models

* add comment

* add warn about degraded quality

Assets 20

19 May 08:33

github-actions

b5416

33d7aed

b5416

CANN: Support MOE Model MUL_MAT_ID (#13042)

Signed-off-by: noemotiovon <[email protected]>

Assets 20

15 May 09:54

github-actions

b5392

c753d7b

b5392

server : proper error handling for missing elements in messages array…

Assets 20

12 May 14:47

github-actions

b5359

de4c07f

b5359

clip : cap max image size 1024 for qwen vl model (#13478)

Assets 20

09 May 14:38

github-actions

b5329

611aa91

b5329

metal : optimize MoE for large batches (#13388)

ggml-ci

Assets 20

08 May 18:12

github-actions

b5316

ee01d71

b5316

server : (webui) fix a very small misalignment (#13387)

* server : (webui) fix a very small misalignment

* restore font-bold

Assets 20

08 May 09:28

github-actions

b5307

814f795

b5307

docker : disable arm64 and intel images (#13356)

Assets 21

Releases: AD2605/llama.cpp

b5503

Uh oh!

b5467

Uh oh!

b5432

Uh oh!

b5423

Uh oh!

b5416

Uh oh!

b5392

Uh oh!

b5359

Uh oh!

b5329

Uh oh!

b5316

Uh oh!

b5307

Uh oh!