Releases: alitariq4589/llama.cpp
Releases · alitariq4589/llama.cpp
b6191
b6140
HIP: disable sync warp shuffel operators from clr amd_warp_sync_funct…
b5774
CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361) * CUDA: add bf16 and f32 support to cublas_mul_mat_batched * Review: add type traits and make function more generic * Review: make check more explicit, add back comments, and fix formatting * Review: fix formatting, remove useless type conversion, fix naming for bools
b5749
main : honor --verbose-prompt on interactive prompts (#14350)