Releases: standby24x7/llama_fix.cpp
Releases · standby24x7/llama_fix.cpp
b3499
cann: support q8_0 for Ascend backend (#8805)
b3486
[SYCL] add conv support (#8688)
b3432
flake.lock: Update (#8610)
b3416
CUDA: fix partial offloading for ne0 % 256 != 0 (#8572)
b3398
ggml : suppress unknown pragma 'GCC' on windows (#8460) This commit adds a macro guard to pragma GCC to avoid the following warning on windows: ```console C:\llama.cpp\ggml\src\ggml-aarch64.c(17,9): warning C4068: unknown pragma 'GCC' [C:\lama.cpp\build\ggml\src\ggml.vcxproj] ```
b3372
gitignore : deprecated binaries
b3368
Name Migration: Build the deprecation-warning 'main' binary every tim…
b3358
Server: Enable setting default sampling parameters via command-line (…
b3346
scripts : fix sync for sycl