Releases: Xarbirus/llama.cpp
Releases · Xarbirus/llama.cpp
b2094
fix `error C2078: too many initializers` with uint32x4_t for MSVC ARM64
b2093
CUDA: fixed mmvq kernel for bs 2,3,4 and -sm row (#5386)
b1354
sync : ggml (ggml-backend) (#3548) * sync : ggml (ggml-backend) ggml-ci * zig : add ggml-backend to the build