26 Oct 19:45

github-actions

e516e50

v3.14.2 Latest

Latest

3.14.2 (2025-10-26)

Bug Fixes

a new release due to a semantic-release failure in the previous release (#518) (e516e50)

Shipped with llama.cpp release b6845

Assets 16

node-llama-cpp-electron-example.Linux.3.14.2.amd64.deb

sha256:33ec32716a955c03caa865695683796bdbd0d3296ae4757a011a5d8b1b16c394

248 MB 2025-10-26T20:02:36Z
node-llama-cpp-electron-example.Linux.3.14.2.amd64.snap

sha256:fa4912d7fa7a12dc8b13861ad568a3b3c26d1cec3826f2f0e3b25afa6e6fbb60

342 MB 2025-10-26T20:02:25Z
node-llama-cpp-electron-example.Linux.3.14.2.arm64.AppImage

sha256:5664320dd84b04f77dfa24f2bb11e324c39ba19e8cae16df58c724480b3515ab

143 MB 2025-10-26T20:02:11Z
node-llama-cpp-electron-example.Linux.3.14.2.arm64.deb

sha256:0f4f84a7b5ae79eea863ee27dd211f385d4d405c4ef73258e7e6ce77dba51306

101 MB 2025-10-26T20:02:43Z
node-llama-cpp-electron-example.Linux.3.14.2.arm64.tar.gz

sha256:218b2f26349d11d7a205bb18587f7147fa604d71d93db41097ad7a65b5eb8a8f

136 MB 2025-10-26T20:02:47Z
node-llama-cpp-electron-example.Linux.3.14.2.x64.tar.gz

sha256:9f8df6ee52088d38ea8473ba40bd793afed7332a7ae658aa59e32723518aad7f

391 MB 2025-10-26T20:02:51Z
node-llama-cpp-electron-example.Linux.3.14.2.x86_64.AppImage

sha256:f3609df2b21eacf941bcf65ffe14d5e05896e6c1efe644114e1a405cf2a1f390

403 MB 2025-10-26T20:02:15Z
node-llama-cpp-electron-example.macOS.3.14.2.arm64.dmg

sha256:a4c65c68a988b5c0f8042e6dc9a2c3f5ace7e7077717fa78029ddc1a09d9c337

137 MB 2025-10-26T19:53:26Z
node-llama-cpp-electron-example.macOS.3.14.2.arm64.zip

sha256:6291b3f274b0509da8ef7ba604c5c774830978bd9f6521543e0c3a0e4fe68d13

132 MB 2025-10-26T19:53:54Z
node-llama-cpp-electron-example.macOS.3.14.2.x64.dmg

sha256:679da3bf62d37f9f1b9702b51001e30cc4a8b1b868a1fd823110ff78b18a9d06

146 MB 2025-10-26T19:53:32Z
Source code (zip)

2025-10-26T17:39:39Z
Source code (tar.gz)

2025-10-26T17:39:39Z

26 Oct 17:32

github-actions

v3.14.1

47475ac

v3.14.1

3.14.1 (2025-10-26)

Bug Fixes

Vulkan: include integrated GPU memory (#516) (47475ac)
Vulkan: deduplicate the same device coming from different drivers (#516) (47475ac)
adapt Llama chat wrappers to breaking llama.cpp changes (#516) (47475ac)

Shipped with llama.cpp release b6843

Assets 2

02 Oct 21:53

github-actions

v3.14.0

02805ee

v3.14.0

3.14.0 (2025-10-02)

Features

Qwen3 Reranker support (#506) (00305f7) (see #506 for prequantized Qwen3 Reranker models you can use)

Bug Fixes

handle HuggingFace rate limit responses (#506) (00305f7)
adapt to llama.cpp breaking changes (#506) (00305f7)

Shipped with llama.cpp release b6673

Assets 16

0 Join discussion

09 Sep 18:20

github-actions

v3.13.0

eefe78c

v3.13.0

3.13.0 (2025-09-09)

Features

Seed OSS support (#502) (eefe78c)

Bug Fixes

adapt to breaking llama.cpp changes (#501) (76b505e)
Vulkan: read external memory usage (#500) (d33cc31)

Shipped with llama.cpp release b6431

Assets 16

0 Join discussion

28 Aug 00:40

github-actions

v3.12.4

c5cd057

v3.12.4

✨ `gpt-oss` is here! ✨

Read about the release in the blog post

3.12.4 (2025-08-28)

Bug Fixes

gpt-oss prompt preloading (#496) (db4a243)

Shipped with llama.cpp release b6301

Assets 16

26 Aug 23:01

github-actions

v3.12.3

6e59160

v3.12.3

✨ `gpt-oss` is here! ✨

Read about the release in the blog post

3.12.3 (2025-08-26)

Bug Fixes

Vulkan: context creation edge cases (#492) (12749c0)
prebuilt binaries CUDA 13 support (#494) (b10999d)
don't share loaded shared libraries between backends (#492) (12749c0)
split prebuilt CUDA binaries into 2 npm modules (#495) (6e59160)

Shipped with llama.cpp release b6294

Assets 16

11 Aug 18:38

github-actions

v3.12.1

f849cd9

v3.12.1

✨ `gpt-oss` is here! ✨

Read about the release in the blog post

3.12.1 (2025-08-11)

Features

comment segment budget (#489) (30eaa23) (documentation: API: LLamaChatPromptOptions["budgets"]["commentTokens"])
Electron template: comment segments
Electron template: improve completions speed when using functions

Bug Fixes

gpt-oss segment budgets (#489) (30eaa23)
add support for more gpt-oss variations (#489) (30eaa23)
default to using a model message for prompt completion on unsupported models (#489) (30eaa23)
prompt completion config (#490) (f849cd9)

Shipped with llama.cpp release b6133

Assets 16

09 Aug 19:14

github-actions

v3.12.0

722e29d

v3.12.0

✨ `gpt-oss` is here! ✨

Read about the release in the blog post

3.12.0 (2025-08-09)

Features

gpt-oss support (#487) (722e29d) (documentation: gpt-oss)

Bug Fixes

Llama: expose the numa (#485) (ea0d815)
add --numa flag to cli commands (#485) (ea0d815)

Shipped with llama.cpp release b6122

Assets 16

0 Join discussion

29 Jul 10:55

github-actions

v3.11.0

5565614

v3.11.0

3.11.0 (2025-07-29)

Features

NUMA policy (#482) (a2ddaa2) (documentation: API: LlamaOptions["numa"])
inspect gpu command: log prebuilt binaries and cloned source releases (#482) (a2ddaa2)

Bug Fixes

add missing GGUF metadata types (#482) (a2ddaa2)
level of some internal logs (#482) (a2ddaa2)
JSON schema grammar edge case (#482) (a2ddaa2)

Shipped with llama.cpp release b6018

Assets 16

0 Join discussion

12 Jun 01:14

github-actions

v3.10.0

59cf309

v3.10.0

3.10.0 (2025-06-12)

Features

JSON Schema Grammar: $defs and $ref support with full inferred types (#472) (9cdbce9)
inspect gguf command: format and print the Jinja chat template with --key .chatTemplate (#472) (9cdbce9)

Bug Fixes

JinjaTemplateChatWrapper: first function call prefix detection (#472) (9cdbce9)
QwenChatWrapper: improve Qwen chat template detection (#472) (9cdbce9)
apply maxTokens on function calling parameters (#472) (9cdbce9)
adjust default prompt completion length based on SWA size when relevant (#472) (9cdbce9)
improve thought segmentation syntax extraction (#472) (9cdbce9)
adapt to llama.cpp changes (#472) (9cdbce9)

Shipped with llama.cpp release b5640

Assets 16

0 Join discussion

Uh oh!

Releases: withcatai/node-llama-cpp

v3.14.2

3.14.2 (2025-10-26)

Bug Fixes

Uh oh!

v3.14.1

3.14.1 (2025-10-26)

Bug Fixes

Uh oh!

v3.14.0

3.14.0 (2025-10-02)

Features

Bug Fixes

Uh oh!

v3.13.0

3.13.0 (2025-09-09)

Features

Bug Fixes

Uh oh!

v3.12.4

✨ gpt-oss is here! ✨

3.12.4 (2025-08-28)

Bug Fixes

Uh oh!

v3.12.3

✨ gpt-oss is here! ✨

3.12.3 (2025-08-26)

Bug Fixes

Uh oh!

v3.12.1

✨ gpt-oss is here! ✨

3.12.1 (2025-08-11)

Features

Bug Fixes

Uh oh!

v3.12.0

✨ gpt-oss is here! ✨

3.12.0 (2025-08-09)

Features

Bug Fixes

Uh oh!

v3.11.0

3.11.0 (2025-07-29)

Features

Bug Fixes

Uh oh!

v3.10.0

3.10.0 (2025-06-12)

Features

Bug Fixes

Uh oh!

✨ `gpt-oss` is here! ✨

✨ `gpt-oss` is here! ✨

✨ `gpt-oss` is here! ✨

✨ `gpt-oss` is here! ✨