Skip to content

Releases: ochafik/llama.cpp

b4155

24 Nov 02:00
96fa2c5

Choose a tag to compare

fix gguf-py:  Conversion error when multiple licenses are configured …

b3995

30 Oct 22:30
61408e7

Choose a tag to compare

kompute: add backend registry / device interfaces (#10045)

Get in line with the other backends by supporting the newer
backend/device registry interfaces.

Signed-off-by: Sergio Lopez <[email protected]>

b3987

28 Oct 22:31
61715d5

Choose a tag to compare

llama : Add IBM granite template (#10013)

* Add granite template to llama.cpp

* Add granite template to test-chat-template.cpp

* Update src/llama.cpp

Co-authored-by: Xuan Son Nguyen <[email protected]>

* Update tests/test-chat-template.cpp

Co-authored-by: Xuan Son Nguyen <[email protected]>

* Added proper template and expected output

* Small change to \n

Small change to \n

* Add code space &

Co-authored-by: Xuan Son Nguyen <[email protected]>

* Fix spacing

* Apply suggestions from code review

* Update src/llama.cpp

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

b3963

23 Oct 11:22

Choose a tag to compare

flake.lock: Update

Flake lock file updates:

• Updated input 'nixpkgs':
    'github:NixOS/nixpkgs/5633bcff0c6162b9e4b5f1264264611e950c8ec7?narHash=sha256-9UTxR8eukdg%2BXZeHgxW5hQA9fIKHsKCdOIUycTryeVw%3D' (2024-10-09)
  → 'github:NixOS/nixpkgs/4c2fcb090b1f3e5b47eaa7bd33913b574a11e0a0?narHash=sha256-/uilDXvCIEs3C9l73JTACm4quuHUsIHcns1c%2BcHUJwA%3D' (2024-10-18)

b3958

22 Oct 11:44
4ff7fe1

Choose a tag to compare

llama : add chat template for RWKV-World + fix EOT (#9968)

* Add chat template for RWKV-World

Signed-off-by: Molly Sophia <[email protected]>

* RWKV: Fix the chat template not being used

Signed-off-by: Molly Sophia <[email protected]>

* RWKV v6: Set EOT token to ``\n\n``

Signed-off-by: Molly Sophia <[email protected]>

* readme: add rwkv into supported model list

Signed-off-by: Molly Sophia <[email protected]>

---------

Signed-off-by: Molly Sophia <[email protected]>