Skip to content

Releases: standby24x7/llama_fix.cpp

b5493

26 May 13:46
79c137f

Choose a tag to compare

examples : allow extracting embeddings from decoder contexts (#13797)

ggml-ci

b5168

22 Apr 14:43
ab47dec

Choose a tag to compare

security : add note about RPC and server functionality (#13061)

* security : add note about RPC functionality

* security : add note about llama-server

b4726

16 Feb 02:13
6dde178

Choose a tag to compare

scripts: fix compare-llama-bench commit hash logic (#11891)

b4722

15 Feb 18:08
68ff663

Choose a tag to compare

repo : update links to new url (#11886)

* repo : update links to new url

ggml-ci

* cont : more urls

ggml-ci

b4122

18 Nov 11:43
9b75f03

Choose a tag to compare

Vulkan: Fix device info output format specifiers (#10366)

* Vulkan: Fix device info output format specifiers

* Vulkan: Use zu printf specifier for size_t instead of ld

b4041

07 Nov 16:20
2319126

Choose a tag to compare

fix q4_0_8_8 format for corrupted tokens issue (#10198)

Co-authored-by: EC2 Default User <[email protected]>

b3898

08 Oct 12:44
458367a

Choose a tag to compare

server : better security control for public deployments (#9776)

* server : more explicit endpoint access settings

* protect /props endpoint

* fix tests

* update server docs

* fix typo

* fix tests

b3828

26 Sep 12:31
95bc82f

Choose a tag to compare

[SYCL] add missed dll file in package (#9577)

* update oneapi to 2024.2

* use 2024.1

---------

Co-authored-by: arthw <[email protected]>

b3619

24 Aug 08:39
8f824ff

Choose a tag to compare

quantize : fix typo in usage help of `quantize.cpp` (#9145)

b3565

11 Aug 05:59
6e02327

Choose a tag to compare

metal : fix uninitialized abort_callback (#8968)