Skip to content

Releases: Gadflyii/llama.cpp

b6736

11 Oct 23:33
11f0af5

Choose a tag to compare

CUDA: faster tile FA, add oob checks, more HSs (#16492)

b6729

10 Oct 19:02
81086cd

Choose a tag to compare

vocab : mark EOT token for Granite models (#16499)

* vocab : mark EOT token for Granite models

* sampling : fallback to EOS when EOT is not found

b6725

09 Oct 16:48
a6cb7c8

Choose a tag to compare

Merge pull request #9 from ggml-org/master

Merge from upstream

b6708

07 Oct 11:30
dec0c8d

Choose a tag to compare

Merge pull request #8 from ggml-org/master

merge from upstream

b6700

06 Oct 22:34
3df2244

Choose a tag to compare

llama : add --no-host to disable host buffers (#16310)

* implement --no-host to disable host buffer

* fix equal_mparams

* move no-host enumeration order together with other model params

---------

Co-authored-by: slaren <[email protected]>

b6647

30 Sep 15:46

Choose a tag to compare

implement --no-host to disable host buffer

b6646

30 Sep 15:37
364a7a6

Choose a tag to compare

common : remove common_has_curl() (#16351)

`test-arg-parser.cpp` has been updated to work consistently,
regardless of whether CURL or SSL support is available, and
now always points to `ggml.ai`.

The previous timeout test has been removed, but it can be
added back by providing a dedicated URL under `ggml.ai`.

Signed-off-by: Adrien Gallouët <[email protected]>

b6649

29 Sep 17:37
4b224c1

Choose a tag to compare

Merge branch 'ggml-org:master' into master