Releases · unslothai/llama.cpp

13 Jul 06:26

a032396

b5888

Merge branch 'ggml-org:master' into master

Assets 15

13 Jul 06:19

github-actions

b5885

f1bdfd4

b5885

Fixes

Assets 15

13 Jul 02:20

github-actions

b5884

c31e606

b5884

tests : cover lfm2 cases in test_ssm_conv (#14651)

Assets 15

12 Jul 08:16

github-actions

b5873

f5e96b3

b5873

model : support LiquidAI LFM2 hybrid family (#14620)

**Important**
LFM2 was [merged ](https://github.com/huggingface/transformers/pull/39340)into transformers, but has not yet been released.
To convert into gguf, install transformers from source
```shell
pip install "transformers @ git+https://github.com/huggingface/transformers.git@main"
```

Assets 15

09 May 22:57

github-actions

b5334

0d90bbe

b5334

Merge branch 'ggml-org:master' into master

Assets 20

08 May 23:38

github-actions

b5319

f688555

b5319

Update getrows.cu

Assets 20

08 May 23:24

github-actions

b5318

15e0328

b5318

ci : limit write permission to only the release step + fixes (#13392)

* ci : limit write permission to only the release step

* fix win cuda file name

* fix license file copy on multi-config generators

Assets 20

06 May 00:06

github-actions

b5287

9070365

b5287

CUDA: fix logic for clearing padding with -ngl 0 (#13320)

Assets 21

04 May 01:20

github-actions

b5272

3e959f0

b5272

imatrix: fix oob writes if src1 is not contiguous (#13286)

Assets 26

03 May 08:24

github-actions

b5270

f5b17e3

b5270

Revert "CUDA: batched+noncont MMQ, refactor bs>1 MoE code (#13199)"

This reverts commit e1e8e0991ffd9e99a445c6812bb519d5bac9f4b5.

Assets 26

Releases: unslothai/llama.cpp

b5888

Uh oh!

b5885

Uh oh!

b5884

Uh oh!

b5873

Uh oh!

b5334

Uh oh!

b5319

Uh oh!

b5318

Uh oh!

b5287

Uh oh!

b5272

Uh oh!

b5270

Uh oh!