Releases · giladgd/llama.cpp

19 Oct 15:02

giladgd

b6795.1

6d172a7

b6795.1 Pre-release

Pre-release

fix: deduplicate and deprioritize Microsoft Direct3D12 vulkan devices from the vulkan-dozen driver

Assets 2

08 Jan 23:13

github-actions

b4450

8d59d91

b4450 Latest

Latest

fix: add missing msg in static_assert (#11143)

Signed-off-by: hydai <[email protected]>

Assets 23

cudart-llama-bin-win-cu11.7-x64.zip

303 MB 2025-01-08T23:13:53Z
cudart-llama-bin-win-cu12.4-x64.zip

373 MB 2025-01-08T23:14:08Z
llama-b4450-bin-macos-arm64.zip

12.6 MB 2025-01-08T23:14:18Z
llama-b4450-bin-macos-x64.zip

13.6 MB 2025-01-08T23:14:19Z
llama-b4450-bin-ubuntu-x64.zip

15.4 MB 2025-01-08T23:14:20Z
llama-b4450-bin-win-avx-x64.zip

9.81 MB 2025-01-08T23:14:21Z
llama-b4450-bin-win-avx2-x64.zip

9.81 MB 2025-01-08T23:14:22Z
llama-b4450-bin-win-avx512-x64.zip

9.83 MB 2025-01-08T23:14:22Z
llama-b4450-bin-win-cuda-cu11.7-x64.zip

147 MB 2025-01-08T23:14:24Z
llama-b4450-bin-win-cuda-cu12.4-x64.zip

147 MB 2025-01-08T23:14:30Z
Source code (zip)

2025-01-08T20:03:28Z
Source code (tar.gz)

2025-01-08T20:03:28Z

02 Jan 02:29

github-actions

b4404

0827b2c

b4404

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (#11027)

* Fixes for clang AVX VNNI

* enable AVX VNNI and alder lake build for MSVC

* Apply suggestions from code review

---------

Co-authored-by: slaren <[email protected]>

Assets 23

17 Oct 00:48

github-actions

b3932

2194200

b3932

fix: allocating CPU buffer with size `0` (#9917)

Assets 22

17 Oct 00:11

github-actions

b3931

73afe68

b3931

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875)

* fix: use `vm_allocate` to allocate CPU backend buffer on macOS

* fix: switch to `posix_memalign` to keep existing `free()` usages work

* feat: move `GGML_ALIGNED_MALLOC` to `ggml-backend-impl.h`, add support for `vm_allocate` on macOS

* style: formatting

* fix: move const outside of `#ifndef`

* style: formatting

* fix: unused var

* fix: transform `GGML_ALIGNED_MALLOC` and `GGML_ALIGNED_FREE` into functions and add them to `ggml-impl.h`

* fix: unused var

* fix: page align to `GGUF_DEFAULT_ALIGNMENT`

* fix: page align to `TENSOR_ALIGNMENT`

* fix: convert `TENSOR_ALIGNMENT` to a macro

* fix: increase page size to `32` on iOS

* fix: iOS page size

* fix: `hbw_posix_memalign` alignment

Assets 22

10 Mar 23:00

github-actions

b2392

bb6d00b

b2392

metal : move mm_id indices to shared mem (#5982)

Assets 14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Releases: giladgd/llama.cpp

b6795.1

Uh oh!

b4450

Uh oh!

b4404

Uh oh!

b3932

Uh oh!

b3931

Uh oh!

b2392

Uh oh!