Releases: AD2605/llama.cpp
Releases · AD2605/llama.cpp
b6617
devops: switch to using ubuntu-22.04-s390x image (#16302) Signed-off-by: Aaron Teo <[email protected]>
b6317
chat : Seed OSS thinking + tool call support (#15552) * Reasoning and tool-calling support for Seed OSS * Fix grammar and partial parsing * Whitespace * New chat template * Update common/chat.cpp Co-authored-by: Sigbjørn Skjæret <[email protected]> * Update common/chat.cpp Co-authored-by: Sigbjørn Skjæret <[email protected]> * Remove unused 'purge_healing_marker' helper --------- Co-authored-by: Sigbjørn Skjæret <[email protected]>
b5854
ggml : prevent integer overflow in gguf tensor size calculation (#14595)
b5795
CANN: update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3 (#14411) * [CANN]update to aclnnGroupedMatmulV2 Signed-off-by: noemotiovon <[email protected]> * Support MUL_MAT_ID on 310p Signed-off-by: noemotiovon <[email protected]> * fix editorconfig Signed-off-by: noemotiovon <[email protected]> --------- Signed-off-by: noemotiovon <[email protected]>
b5787
Add Conv2d for CPU (#14388) * Conv2D: Add CPU version * Half decent * Tiled approach for F32 * remove file * Fix tests * Support F16 operations * add assert about size * Review: further formatting fixes, add assert and use CPU version of fp32->fp16
b5753
opencl: ref count `ggml_backend_opencl_context` and refactor profilin…
b5716
ggml : fix repack work size for mul_mat_id (#14292) ggml-ci
b5688
ggml-cpu : remove the weak alias trick (#14221)
b5611
webui: fix sidebar being covered by main content (#14082) * webui: fix sidebar being covered by main content Signed-off-by: Xiaodong Ye <[email protected]> * webui: update index.html.gz Signed-off-by: Xiaodong Ye <[email protected]> --------- Signed-off-by: Xiaodong Ye <[email protected]>
b5518
convert : fix tensor naming conflict for llama 4 vision (#13836) * convert : fix tensor naming conflict for llama 4 vision * add comment