Releases · EAddario/llama.cpp

13 Jul 17:20

982e347

b5890

quantize : fix minor logic flaw in --tensor-type (#14572)

Assets 15

11 Jul 19:54

github-actions

b5873

f5e96b3

b5873

model : support LiquidAI LFM2 hybrid family (#14620)

**Important**
LFM2 was [merged ](https://github.com/huggingface/transformers/pull/39340)into transformers, but has not yet been released.
To convert into gguf, install transformers from source
```shell
pip install "transformers @ git+https://github.com/huggingface/transformers.git@main"
```

Assets 15

07 Jul 22:03

github-actions

b5837

12f55c3

b5837

llama : remove ggml_cont where possible (#14568)

Assets 15

05 Jul 12:32

github-actions

b5833

a0374a6

b5833

vulkan: Handle updated FA dim2/3 definition (#14518)

* vulkan: Handle updated FA dim2/3 definition

Pack mask boolean and n_head_log2 into a single dword to keep the push
constant block under the 128B limit.

* handle null mask for gqa

* allow gqa with dim3>1

Assets 15

19 Jun 12:58

github-actions

b5707

600e3e9

b5707

sycl: Cleanup codepaths in Get Rows in sycl backend (#14215)

Addresses unused reorder path

Assets 15

15 Jun 20:45

github-actions

b5672

30e5b01

b5672

quantize : change int to unsigned int for KV overrides (#14197)

Assets 15

15 Jun 12:50

github-actions

b5669

5fce5f9

b5669

kv-cache : fix use-after-move of defrag info (#14189)

ggml-ci

Assets 15

14 Jun 14:00

github-actions

b5663

2e42be4

b5663

compare-llama-bench: add option to plot (#14169)

* compare llama-bench: add option to plot

* Address review comments: convert case + add type hints

* Add matplotlib to requirements

* fix tests

* Improve comment and fix assert condition for test

* Add back default test_name, add --plot_log_scale

* use log_scale regardless of x_values

Assets 15

13 Jun 07:47

github-actions

b5649

c33fe8b

b5649

vocab : prevent heap overflow when vocab is too small (#14145)

ggml-ci

Assets 15

29 May 08:37

github-actions

b5530

6385b84

b5530

llama : add RobertaForSequenceClassification reranker support (#13875)

Assets 18

Releases: EAddario/llama.cpp

b5890

Uh oh!

b5873

Uh oh!

b5837

Uh oh!

b5833

Uh oh!

b5707

Uh oh!

b5672

Uh oh!

b5669

Uh oh!

b5663

Uh oh!

b5649

Uh oh!

b5530

Uh oh!