Skip to content

Releases: ngxson/llama.cpp

b4869

10 Mar 20:14
2c9f833
Compare
Choose a tag to compare
mat vec double buffer (#12188)

b4868

10 Mar 18:01
2513645
Compare
Choose a tag to compare
musa: support new arch mp_31 and update doc (#12296)

Signed-off-by: Xiaodong Ye <[email protected]>

b4867

10 Mar 17:40
8acdacb
Compare
Choose a tag to compare
opencl: use OpenCL C standard supported by the device (#12221)

This patch nudges the llama.cpp a bit to be supported on PoCL which
doesn't support OpenCL C CL2.0. The issue is solved by querying the
device for the supported OpenCL C versions and using the highest one
available.

b4865

10 Mar 12:48
e128a1b
Compare
Choose a tag to compare
tests : fix test-quantize-fns to init the CPU backend (#12306)

ggml-ci

b4864

10 Mar 12:28
6ef79a6
Compare
Choose a tag to compare
common : refactor '-o' option (#12278)

As discussed in PR 'llama-tts : add -o option' (#12042):

* common_params : 'out_file' string is the only output file name parameter left in common_params. It's intended to be used in all example programs implementing an '-o' option.

* cvector-generator, export-lora, imatrix : default output filenames moved from 'common_params' to the 'main()' of each example program.

b4863

10 Mar 11:43
4e39a3c
Compare
Choose a tag to compare
`server`: extract <think> tags from qwq outputs (#12297)

* extract <think> tags from qwq outputs

* const for all static regexes in chat.cpp

b4862

10 Mar 10:28
be421fc
Compare
Choose a tag to compare
`tool-call`: ensure there's always a non-empty tool call id (#12292)

b4859

10 Mar 09:16
8352cdc
Compare
Choose a tag to compare
llava : fix bug in minicpm-v code (#11513)

* fix bug in minicpm-v code

* update readme of minicpm-v

b4858

09 Mar 18:04
1e2f78a
Compare
Choose a tag to compare
server : add speculative decoding presets for FIM (#12287)

b4856

08 Mar 16:44
6fefc05
Compare
Choose a tag to compare
ggml-backend : make path_str compatible with C++20 (#12269)