Releases · ngxson/llama.cpp

03 Mar 13:53

d5c63cd

b4805

test-backend-ops : add option -p to filter by op params (#12155)

Assets 25

03 Mar 13:47

github-actions

b4804

9660ffe

b4804

ggml : fix kleidiai build (#12159)

The libggml API has changed, but this has not been updated.

Assets 25

03 Mar 13:45

github-actions

b4803

c950a1f

b4803

Adding UTF-8 support to llama.cpp (#12111)

For emojis, non-alpha characters, etc.

Signed-off-by: Eric Curtin <[email protected]>

Assets 25

03 Mar 10:49

github-actions

b4801

ece9745

b4801

SYCL: Move CPY kernels to a separate file and add few missing kernels…

Assets 25

02 Mar 21:51

github-actions

b4800

cc473ca

b4800

ggml-backend : keep paths in native string type when possible (#12144)

Assets 25

02 Mar 14:38

github-actions

b4799

14dec0c

b4799

main: use jinja chat template system prompt by default (#12118)

* Use jinja chat template system prompt by default

* faster conditional order

* remove nested ternary

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

Assets 25

01 Mar 15:06

github-actions

b4798

1782cdf

b4798

main: update outdated system prompt message (followup to #12131) (#12…

Assets 25

01 Mar 13:43

github-actions

b4797

45a8e76

b4797

common : add --system-prompt parameter, replace behavior of -p in con…

Assets 25

01 Mar 12:40

github-actions

b4796

80c41dd

b4796

CUDA: compress mode option and default to size (#12029)

cuda 12.8 added the option to specify stronger compression for binaries, so we now default to "size".

Assets 25

28 Feb 14:32

github-actions

b4793

70680c4

b4793

ggml : upgrade init_tensor API to return a ggml_status (#11854)

* Upgrade init_tensor API to return a ggml_status

To prepare for an 'abort-free' ggml
(ggml not to abort on OOMs but return a OOM status),
as agreeed with Diego in the ggml repo,
upgrade the init_tensor() and view_init() APIs
to return a ggml_status.

* misc fixes

---------

Co-authored-by: slaren <[email protected]>

Assets 25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ngxson/llama.cpp

b4805

Uh oh!

b4804

Uh oh!

b4803

Uh oh!

b4801

Uh oh!

b4800

Uh oh!

b4799

Uh oh!

b4798

Uh oh!

b4797

Uh oh!

b4796

Uh oh!

b4793

Uh oh!