Releases · ggml-org/llama.cpp

02 Apr 14:12

e0e912f

b5028

llama : add option to override model tensor buffers (#11397)

* llama : add option to override tensor buffers

* ggml : fix possible underflow in ggml_nbytes

Assets 25

02 Apr 11:28

github-actions

b5026

83a88bd

b5026

vocab : BailingMoE : change possessive quantifiers to greedy (#12677)

Assets 25

02 Apr 10:50

github-actions

b5025

42eb248

b5025

common : remove json.hpp from common.cpp (#12697)

* common : remove json.hpp from common.cpp

* fix comment

Assets 26

01 Apr 17:41

github-actions

b5022

f423981

b5022

opencl : fix memory allocation size (#12649)

issue:
https://github.com/CodeLinaro/llama.cpp/pull/17#issuecomment-2760611283

This patch fixes the memory allocation size
not exceeding the maximum size of the OpenCL device.

Assets 26

01 Apr 13:42

github-actions

b5021

e39e727

b5021

llama : use LLM_KV_GENERAL_FILE_TYPE instead of gguf_find_key (#12672)

Assets 26

01 Apr 12:40

github-actions

b5019

3fd072a

b5019

metal : use F32 prec in FA kernels (#12688)

* metal : use F32 prec in FA kernels

ggml-ci

* cont : fix FA vec kernel

ggml-ci

Assets 26

01 Apr 12:10

github-actions

b5018

a6f32f0

b5018

Fix clang warning in gguf_check_reserved_keys (#12686)

* Fix clang warning in gguf_check_reserved_keys

Signed-off-by: Xiaodong Ye <[email protected]>

* Fix typo

Signed-off-by: Xiaodong Ye <[email protected]>

---------

Signed-off-by: Xiaodong Ye <[email protected]>

Assets 25

01 Apr 10:22

github-actions

b5017

2bb3597

b5017

vulkan: fix build when glslc doesn't support coopmat (#12683)

Assets 25

01 Apr 09:33

github-actions

b5016

8293970

b5016

SYCL: Rename oneMKL to oneMath (#12192)

* Rename oneMKL Interface to oneMath

* Use oneMath for Intel vendor

* Rename occurences to mkl

* clang-format

* Silence verbose warnings

* Set oneMath HIP_TARGETS

* Fix silence warnings

* Remove step to build oneMath from build instructions

* Use fixed oneMath version

* Remove INTEL_CPU

* Fold CMake oneDNN conditions

* Use Intel oneMKL for Intel devices

* Improve CMake message

* Link against MKL::MKL_SYCL::BLAS only

* Move oneMath documentation to Nvidia and AMD sections

Assets 25

01 Apr 09:16

github-actions

b5015

8bbf260

b5015

SYCL: switch to SYCL namespace (#12674)

Assets 25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ggml-org/llama.cpp

b5028

Uh oh!

b5026

Uh oh!

b5025

Uh oh!

b5022

Uh oh!

b5021

Uh oh!

b5019

Uh oh!

b5018

Uh oh!

b5017

Uh oh!

b5016

Uh oh!

b5015

Uh oh!