Releases · ServeurpersoCom/llama.cpp

04 Oct 16:52

f392839

b6689

rpc : check src buffer when copying tensor (#16421)

Only dst buffer is guaranteed to be an RPC buffer. Add check for the src
one.

Assets 15

04 Oct 11:26

github-actions

b6688

898acba

b6688

rpc : add support for multiple devices (#16276)

* rpc : add support for multiple devices

Allow rpc-server to expose multiple devices from a single endpoint.
Change RPC protocol to include device identifier where needed.

closes: #15210

* fixes

* use ggml_backend_reg_t

* address review comments

* fix llama-bench backend report

* address review comments, change device naming

* fix cmd order

Assets 15

03 Oct 19:27

github-actions

b6686

128d522

b6686

chat : support Magistral thinking (#16413)

* feat: added a dedicated Magistral chat format that preserves [THINK] spans, parses reasoning before tool calls

* feat: new flow in the chat template test suite for Magistral

Assets 15

03 Oct 16:55

github-actions

b6684

606a73f

b6684

metal : fix loop bound in ggml_mem_ranges (#16412)

Assets 15

03 Oct 13:37

github-actions

b6683

946f71e

b6683

llama : fix shapes for bert/mpt q/k norm (#16409)

Assets 15

03 Oct 10:11

github-actions

b6679

0e1f838

b6679

vulkan: Fix FA coopmat1 invalid array indexing (#16365)

When computing sinks, the cm1 shader was looping r from 0 to Br rather than
to rows_per_thread. I must have copied this from the scalar path (where it is
correct), and somehow it wasn't causing failures on current drivers.

Assets 15

03 Oct 09:03

github-actions

b6676

e308efd

b6676

vulkan: in flash attention, bounds check against nem1 (don't rely on …

Assets 15

02 Oct 20:19

github-actions

b6673

d64c810

b6673

test-barrier : do not use more threads than physically available (#16…

Assets 15

02 Oct 14:20

github-actions

b6670

91a2a56

b6670

musa: update compile flags (#16265)

Signed-off-by: Xiaodong Ye <[email protected]>

Assets 15

02 Oct 09:27

github-actions

b6668

f09aefa

b6668

ci: update vulkan ci (#16294)

Assets 15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ServeurpersoCom/llama.cpp

b6689

Uh oh!

b6688

Uh oh!

b6686

Uh oh!

b6684

Uh oh!

b6683

Uh oh!

b6679

Uh oh!

b6676

Uh oh!

b6673

Uh oh!

b6670

Uh oh!

b6668

Uh oh!