Releases · ngxson/llama.cpp

13 Mar 11:22

e0dbec0

b4881

llama : refactor llama_context, llama_kv_cache, llm_build_context (#1…

Assets 26

13 Mar 10:55

github-actions

b4880

2048b59

b4880

server : fix crash when using verbose output with input tokens that a…

Assets 26

12 Mar 19:51

github-actions

b4879

f08f4b3

b4879

Update build.yml for Windows Vulkan builder to use Vulkan 1.4.304 SDK…

Assets 26

12 Mar 10:51

github-actions

b4877

363f8c5

b4877

sycl : variable sg_size support for mmvq kernels (#12336)

Assets 26

12 Mar 09:56

github-actions

b4876

34c961b

b4876

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315)

When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to
selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need
to avoid launching them with parameters for warp64

Assets 26

12 Mar 09:19

github-actions

b4875

7841fc7

b4875

llama : Add Gemma 3 support (+ experimental vision capability) (#12343)

* llama : Add Gemma 3 text-only support

* fix python coding style

* fix compile on ubuntu

* python: fix style

* fix ubuntu compile

* fix build on ubuntu (again)

* fix ubuntu build, finally

* clip : Experimental support for Gemma 3 vision (#12344)

* clip : Experimental support for Gemma 3 vision

* fix build

* PRId64

Assets 26

12 Mar 06:47

github-actions

b4874

bf69cfe

b4874

vulkan: fix bug in coopmat1 mul_mat_id (#12316)

* tests: run mul_mat_id with a larger N

* vulkan: fix bug in coopmat1 mul_mat_id

Assets 26

11 Mar 19:59

github-actions

b4873

10f2e81

b4873

CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows …

Assets 26

11 Mar 14:06

github-actions

b4872

ba76543

b4872

ggml-backend : fix backend search path (#12330)

* Fix backend search path

* replace .native() with '/'

* reverted .native()

Assets 26

11 Mar 12:30

github-actions

b4871

6ab2e47

b4871

metal : Cache the Metal library at the device context level (#12265)

Assets 26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Releases: ngxson/llama.cpp

b4881

Uh oh!

b4880

Uh oh!

b4879

Uh oh!

b4877

Uh oh!

b4876

Uh oh!

b4875

Uh oh!

b4874

Uh oh!

b4873

Uh oh!

b4872

Uh oh!

b4871

Uh oh!