Skip to content

Releases: ServeurpersoCom/llama.cpp

b6689

04 Oct 16:52
f392839
Compare
Choose a tag to compare
rpc : check src buffer when copying tensor (#16421)

Only dst buffer is guaranteed to be an RPC buffer. Add check for the src
one.

b6688

04 Oct 11:26
898acba
Compare
Choose a tag to compare
rpc : add support for multiple devices (#16276)

* rpc : add support for multiple devices

Allow rpc-server to expose multiple devices from a single endpoint.
Change RPC protocol to include device identifier where needed.

closes: #15210

* fixes

* use ggml_backend_reg_t

* address review comments

* fix llama-bench backend report

* address review comments, change device naming

* fix cmd order

b6686

03 Oct 19:27
128d522
Compare
Choose a tag to compare
chat : support Magistral thinking (#16413)

* feat: added a dedicated Magistral chat format that preserves [THINK] spans, parses reasoning before tool calls

* feat: new flow in the chat template test suite for Magistral

b6684

03 Oct 16:55
606a73f
Compare
Choose a tag to compare
metal : fix loop bound in ggml_mem_ranges (#16412)

b6683

03 Oct 13:37
946f71e
Compare
Choose a tag to compare
llama : fix shapes for bert/mpt q/k norm (#16409)

b6679

03 Oct 10:11
0e1f838
Compare
Choose a tag to compare
vulkan: Fix FA coopmat1 invalid array indexing (#16365)

When computing sinks, the cm1 shader was looping r from 0 to Br rather than
to rows_per_thread. I must have copied this from the scalar path (where it is
correct), and somehow it wasn't causing failures on current drivers.

b6676

03 Oct 09:03
e308efd
Compare
Choose a tag to compare
vulkan: in flash attention, bounds check against nem1 (don't rely on …

b6673

02 Oct 20:19
d64c810
Compare
Choose a tag to compare
test-barrier : do not use more threads than physically available (#16…

b6670

02 Oct 14:20
91a2a56
Compare
Choose a tag to compare
musa: update compile flags (#16265)

Signed-off-by: Xiaodong Ye <[email protected]>

b6668

02 Oct 09:27
f09aefa
Compare
Choose a tag to compare
ci: update vulkan ci (#16294)