Releases · mofosyne/llama.cpp

16 May 06:24

ad52d5c

b2898

doc: add references to hugging face GGUF-my-repo quantisation web too…

Assets 21

15 May 05:02

github-actions

b2886

9f77348

b2886

script : sync ggml-rpc

Assets 20

14 May 08:56

github-actions

b2876

5416002

b2876

llama : disable pipeline parallelism with nkvo (#7265)

Assets 19

14 May 07:03

github-actions

b2874

e0f5561

b2874

Add left recursion check: quit early instead of going into an infinit…

Assets 19

13 May 03:21

github-actions

b2866

b1f8af1

b2866

convert.py: Outfile default name change and additional metadata suppo…

Assets 19

13 May 00:31

github-actions

b2864

cbf7589

b2864

[SYCL] Add oneapi runtime dll files to win release package (#7241)

* add oneapi running time dlls to release package

* fix path

* fix path

* fix path

* fix path

* fix path

---------

Co-authored-by: Zhang <[email protected]>

Assets 19

10 May 12:14

github-actions

b2839

4e38809

b2839

Fix memory bug in grammar parser (#7194)

The llama.cpp grammar parser had a bug where forgetting to add a closing
quotation mark to strings would cause parsing to crash. Anyone running a
server on a public endpoint is advised to upgrade. To reproduce this bug

    ./llamafile -m foo.gguf -p bar --grammar 'root::="'

Credit for discovering and reporting this issue goes to Eclypsium
Security Researcher Richard Johnson <[email protected]>.

Assets 19

10 May 05:12

github-actions

b2836

8c570c9

b2836

Minor arithmetic improvement to mmvq wrapper kernel (#7172)

Assets 19

08 May 05:22

github-actions

b2806

c780e75

b2806

Further tidy on Android instructions README.md (#7077)

* Further tidy on Android instructions README.md

Fixed some logic when following readme direction

* Clean up redundent information

A new user arriving will see simple directions on llama.cpp homepage

* corrected puncuation

Period after cmake, colon after termux

* re-word for clarity

method seems to be more correct, instead of alternative in this context

* Organized required packages per build type

building llama.cpp with NDK on a pc doesn't require installing clang, cmake, git, or wget in termux.

* README.md

corrected title

* fix trailing whitespace

Assets 19

06 May 06:23

github-actions

b2794

628b299

b2794

Adding support for the --numa argument for llama-bench. (#7080)

Assets 19

Releases: mofosyne/llama.cpp

b2898

Uh oh!

b2886

Uh oh!

b2876

Uh oh!

b2874

Uh oh!

b2866

Uh oh!

b2864

Uh oh!

b2839

Uh oh!

b2836

Uh oh!

b2806

Uh oh!

b2794

Uh oh!