Releases · bugparty/llama.cpp

18 Sep 21:51

3edd87c

b6515 Latest

Latest

opencl: optimize mxfp4 kernels (#16037)

- flatten mxfp4 and packed fp4->fp16 bit-wise convert function (replace lut)
- MoE kernel optimizations

---------

Co-authored-by: Li He <[email protected]>

Assets 15

cudart-llama-bin-win-cuda-12.4-x64.zip

sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6

373 MB 2025-09-18T21:51:54Z
llama-b6515-bin-macos-arm64.zip

sha256:79198c2fa02b83d400ef40cf5be6359017fc242b488524d524940fdc7bd9771a

10.2 MB 2025-09-18T21:52:04Z
llama-b6515-bin-macos-x64.zip

sha256:8d4ccedad9e8ee7972761c5968f043e40af2f641016472a48488b96a63f687b2

28.4 MB 2025-09-18T21:52:05Z
llama-b6515-bin-ubuntu-vulkan-x64.zip

sha256:e0ea15ede3a0623926d5d49a5cffaba86ab8a001232d7573509be83675a0e048

25.1 MB 2025-09-18T21:52:07Z
llama-b6515-bin-ubuntu-x64.zip

sha256:970d42f9a8763e4032bf5b3c1d130a5931eedffeebbea56d805ac583375464db

12.2 MB 2025-09-18T21:52:08Z
llama-b6515-bin-win-cpu-arm64.zip

sha256:ad3b63b7d8714eb3327f8aa801215e7cff6313fc4ae710f7a05e02eb0fa9b2c6

10.4 MB 2025-09-18T21:52:09Z
llama-b6515-bin-win-cpu-x64.zip

sha256:62adcd14066e440f9081c0c8cdf1e62ec9410733f7ca692f59d002d0bcb07650

13.4 MB 2025-09-18T21:52:10Z
llama-b6515-bin-win-cuda-12.4-x64.zip

sha256:ef137046affcd04012a35864cd4da34e0f19da0309d4140946a3c8ce419b4412

146 MB 2025-09-18T21:52:11Z
llama-b6515-bin-win-hip-radeon-x64.zip

sha256:43c5d0280a0e3cb04b5e976bfd4f1954f0ef994e937a1176132fbd7da22ed908

318 MB 2025-09-18T21:52:15Z
llama-b6515-bin-win-opencl-adreno-arm64.zip

sha256:71745994c5e13fe01b76ee64f8538a3fbda596887e92fc9abd512d1477f6b4dd

10.8 MB 2025-09-18T21:52:24Z
Source code (zip)

2025-09-18T19:03:34Z
Source code (tar.gz)

2025-09-18T19:03:34Z

16 Sep 21:31

github-actions

b6490

8ff2060

b6490

llama-bench: add --n-cpu-moe support (#15952)

* llama-bench: add --n-cpu-moe support

Support --n-cpu-moe in llama-bench the same way it is supported by
llama-server.

Assets 15

15 Sep 21:57

github-actions

b6479

b907255

b6479

SYCL: Add COUNT_EQUAL operator support (#15991)

* SYCL: Add COUNT_EQUAL operator support (rebased on master)

* SYCL: remove duplicate op_count_equal definition

* tests: remove test_count_equal_typed and use test_count_equal for all cases

* tests: keep only I32 case for COUNT_EQUAL as suggested

* tests: keep only I32 case for COUNT_EQUAL as requested

Assets 15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Releases: bugparty/llama.cpp

b6515

Uh oh!

b6490

Uh oh!

b6479

Uh oh!