Releases · standby24x7/llama_fix.cpp

26 May 13:46

79c137f

b5493 Latest

Latest

examples : allow extracting embeddings from decoder contexts (#13797)

ggml-ci

Assets 18

cudart-llama-bin-win-cuda-11.7-x64.zip

303 MB 2025-05-26T13:46:29Z
cudart-llama-bin-win-cuda-12.4-x64.zip

373 MB 2025-05-26T13:46:38Z
llama-b5493-bin-macos-arm64.zip

10.8 MB 2025-05-26T13:46:49Z
llama-b5493-bin-macos-x64.zip

26 MB 2025-05-26T13:46:49Z
llama-b5493-bin-ubuntu-arm64.zip

11.7 MB 2025-05-26T13:46:51Z
llama-b5493-bin-ubuntu-vulkan-x64.zip

20 MB 2025-05-26T13:46:51Z
llama-b5493-bin-ubuntu-x64.zip

12.3 MB 2025-05-26T13:46:52Z
llama-b5493-bin-win-cpu-arm64.zip

11.1 MB 2025-05-26T13:46:53Z
llama-b5493-bin-win-cpu-x64.zip

13.7 MB 2025-05-26T13:46:54Z
llama-b5493-bin-win-cuda-11.7-x64.zip

108 MB 2025-05-26T13:46:55Z
Source code (zip)

2025-05-26T11:03:54Z
Source code (tar.gz)

2025-05-26T11:03:54Z

22 Apr 14:43

github-actions

b5168

ab47dec

b5168

security : add note about RPC and server functionality (#13061)

* security : add note about RPC functionality

* security : add note about llama-server

Assets 26

16 Feb 02:13

github-actions

b4726

6dde178

b4726

scripts: fix compare-llama-bench commit hash logic (#11891)

Assets 24

15 Feb 18:08

github-actions

b4722

68ff663

b4722

repo : update links to new url (#11886)

* repo : update links to new url

ggml-ci

* cont : more urls

ggml-ci

Assets 24

18 Nov 11:43

github-actions

b4122

9b75f03

b4122

Vulkan: Fix device info output format specifiers (#10366)

* Vulkan: Fix device info output format specifiers

* Vulkan: Use zu printf specifier for size_t instead of ld

Assets 21

07 Nov 16:20

github-actions

b4041

2319126

b4041

fix q4_0_8_8 format for corrupted tokens issue (#10198)

Co-authored-by: EC2 Default User <[email protected]>

Assets 22

08 Oct 12:44

github-actions

b3898

458367a

b3898

server : better security control for public deployments (#9776)

* server : more explicit endpoint access settings

* protect /props endpoint

* fix tests

* update server docs

* fix typo

* fix tests

Assets 22

26 Sep 12:31

github-actions

b3828

95bc82f

b3828

[SYCL] add missed dll file in package (#9577)

* update oneapi to 2024.2

* use 2024.1

---------

Co-authored-by: arthw <[email protected]>

Assets 22

24 Aug 08:39

github-actions

b3619

8f824ff

b3619

quantize : fix typo in usage help of `quantize.cpp` (#9145)

Assets 19

11 Aug 05:59

github-actions

b3565

6e02327

b3565

metal : fix uninitialized abort_callback (#8968)

Assets 20

Releases: standby24x7/llama_fix.cpp

b5493

Uh oh!

b5168

Uh oh!

b4726

Uh oh!

b4722

Uh oh!

b4122

Uh oh!

b4041

Uh oh!

b3898

Uh oh!

b3828

Uh oh!

b3619

Uh oh!

b3565

Uh oh!