Skip to content

Releases: mofosyne/llama.cpp

b2898

16 May 06:24
ad52d5c

Choose a tag to compare

doc: add references to hugging face GGUF-my-repo quantisation web too…

b2886

15 May 05:02
9f77348

Choose a tag to compare

script : sync ggml-rpc

b2876

14 May 08:56
5416002

Choose a tag to compare

llama : disable pipeline parallelism with nkvo (#7265)

b2874

14 May 07:03
e0f5561

Choose a tag to compare

Add left recursion check: quit early instead of going into an infinit…

b2866

13 May 03:21
b1f8af1

Choose a tag to compare

convert.py: Outfile default name change and additional metadata suppo…

b2864

13 May 00:31
cbf7589

Choose a tag to compare

[SYCL] Add oneapi runtime dll files to win release package (#7241)

* add oneapi running time dlls to release package

* fix path

* fix path

* fix path

* fix path

* fix path

---------

Co-authored-by: Zhang <[email protected]>

b2839

10 May 12:14
4e38809

Choose a tag to compare

Fix memory bug in grammar parser (#7194)

The llama.cpp grammar parser had a bug where forgetting to add a closing
quotation mark to strings would cause parsing to crash. Anyone running a
server on a public endpoint is advised to upgrade. To reproduce this bug

    ./llamafile -m foo.gguf -p bar --grammar 'root::="'

Credit for discovering and reporting this issue goes to Eclypsium
Security Researcher Richard Johnson <[email protected]>.

b2836

10 May 05:12
8c570c9

Choose a tag to compare

Minor arithmetic improvement to mmvq wrapper kernel (#7172)

b2806

08 May 05:22
c780e75

Choose a tag to compare

Further tidy on Android instructions README.md (#7077)

* Further tidy on Android instructions README.md

Fixed some logic when following readme direction

* Clean up redundent information

A new user arriving will see simple directions on llama.cpp homepage

* corrected puncuation

Period after cmake, colon after termux

* re-word for clarity

method seems to be more correct, instead of alternative in this context

* Organized required packages per build type

building llama.cpp with NDK on a pc doesn't require installing clang, cmake, git, or wget in termux.

* README.md

corrected title

* fix trailing whitespace

b2794

06 May 06:23
628b299

Choose a tag to compare

Adding support for the --numa argument for llama-bench. (#7080)