Releases · ochafik/llama.cpp

13 Feb 01:06

a394039

b4702

ggml-cpu : add chunking support to mul_mat_id (#11666)

* ggml-cpu : add chunking support to mul_mat_id

* allocate chunk counter in wdata
parallelize src1 quantization by column to allows parallelization even when there is only one row

* disable for arm

* cleanup

* better way to disable for arm

* fix uninitialized counter when using 1 thread only

* revert test-backend-ops changes

Assets 23

12 Feb 13:04

github-actions

b4692

c3d6af7

b4692

CUDA: fix CUDART_VERSION checks (#11821)

Assets 23

09 Feb 18:41

github-actions

b4677

19d3c82

b4677

There's a better way of clearing lines (#11756)

Use the ANSI escape code for clearing a line.

Signed-off-by: Eric Curtin <[email protected]>

Assets 23

08 Feb 15:16

github-actions

b4671

4d3465c

b4671

ggml: Fix data race in ggml threadpool (#11736)

After the barrier in last iteration is executed, still the loop termination
condition will be executed. However main thread can destroy the cgraph object
and its nodes already, then another thread will access it, but the thing is already gone.
Also trouble can happen when n_nodes == 0 or abort is called, but I'm not sure if the
prior situation is possible.

Last syncronization should be done after the loop to ensure the cgraph/cplan won't be
accessed after the main thread exits from the function.

Assets 23

04 Feb 16:32

github-actions

b4636

db288b6

b4636

`tool-call`: command r7b fix for normal responses (#11608)

* fix command r7b normal response regex + add to server test

* test multiline non-tool-call responses in test-chat

Assets 23

04 Feb 00:21

github-actions

b4628

cde3833

b4628

`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to …

Assets 23

03 Feb 10:42

github-actions

b4622

d92cb67

b4622

server : (webui) Fix Shift+Enter handling (#11609)

* Fix Shift+Enter handling

`exact` on the Enter handler means the message is not sent when Shift+Enter is pressed anyway

* build index.html.gz

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

Assets 22

02 Feb 10:01

github-actions

b4615

bfcce4d

b4615

`tool-call`: support Command R7B (+ return tool_plan "thoughts" in AP…

Assets 23

01 Feb 12:53

github-actions

b4610

cfd74c8

b4610

`sync`: minja (https://github.com/google/minja/commit/418a2364b56dc9b…

Assets 23

01 Feb 12:03

github-actions

b4609

ecef206

b4609

Implement s3:// protocol (#11511)

For those that want to pull from s3

Signed-off-by: Eric Curtin <[email protected]>

Assets 23

Releases: ochafik/llama.cpp

b4702

Uh oh!

b4692

Uh oh!

b4677

Uh oh!

b4671

Uh oh!

b4636

Uh oh!

b4628

Uh oh!

b4622

Uh oh!

b4615

Uh oh!

b4610

Uh oh!

b4609

Uh oh!