Skip to content

Releases: giladgd/llama.cpp

b6795.1

19 Oct 15:02
6d172a7

Choose a tag to compare

b6795.1 Pre-release
Pre-release
  • fix: deduplicate and deprioritize Microsoft Direct3D12 vulkan devices from the vulkan-dozen driver

b4450

08 Jan 23:13
8d59d91

Choose a tag to compare

fix: add missing msg in static_assert (#11143)

Signed-off-by: hydai <[email protected]>

b4404

02 Jan 02:29
0827b2c

Choose a tag to compare

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (#11027)

* Fixes for clang AVX VNNI

* enable AVX VNNI and alder lake build for MSVC

* Apply suggestions from code review

---------

Co-authored-by: slaren <[email protected]>

b3932

17 Oct 00:48
2194200

Choose a tag to compare

fix: allocating CPU buffer with size `0` (#9917)

b3931

17 Oct 00:11
73afe68

Choose a tag to compare

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (#9875)

* fix: use `vm_allocate` to allocate CPU backend buffer on macOS

* fix: switch to `posix_memalign` to keep existing `free()` usages work

* feat: move `GGML_ALIGNED_MALLOC` to `ggml-backend-impl.h`, add support for `vm_allocate` on macOS

* style: formatting

* fix: move const outside of `#ifndef`

* style: formatting

* fix: unused var

* fix: transform `GGML_ALIGNED_MALLOC` and `GGML_ALIGNED_FREE` into functions and add them to `ggml-impl.h`

* fix: unused var

* fix: page align to `GGUF_DEFAULT_ALIGNMENT`

* fix: page align to `TENSOR_ALIGNMENT`

* fix: convert `TENSOR_ALIGNMENT` to a macro

* fix: increase page size to `32` on iOS

* fix: iOS page size

* fix: `hbw_posix_memalign` alignment

b2392

10 Mar 23:00
bb6d00b

Choose a tag to compare

metal : move mm_id indices to shared mem (#5982)