Releases: linehill/llama.cpp
Releases · linehill/llama.cpp
b6135
musa: fix failures in test-backend-ops for mul_mat_id op (#15236) * musa: fix failures in test-backend-ops for mul_mat_id op Signed-off-by: Xiaodong Ye <[email protected]> * Address review comments Signed-off-by: Xiaodong Ye <[email protected]> --------- Signed-off-by: Xiaodong Ye <[email protected]>
b5750
server : move no API key doc to /health (#14352)
b5681
HIP: disable rocwmma on gfx12 by default until rocm 7.0 (#14202)
b4984
server : include speculative decoding stats when timings_per_token is…