improve binary compatibility testing on Electron apps (#386) (97abbca)
too many abort signal listeners (#386) (97abbca)
log level of some lower level logs (#386) (97abbca)
context window missing response during generation on specific extreme conditions (#386) (97abbca)
adapt to breaking llama.cpp changes (#386) (97abbca)
automatically resolve compiler is out of heap space CUDA build error (#386) (97abbca)

Features

Llama 3.2 3B function calling support (#386) (97abbca)
use llama.cpp backend registry for GPUs instead of custom implementations (#386) (97abbca)
getLlama: build: "try" option (#386) (97abbca)
init command: --model flag (#386) (97abbca)
JSON Schema grammar: array prefixItems, minItems, maxItems support (#388) (4d387de)
JSON Schema grammar: object additionalProperties, minProperties, maxProperties support (#388) (4d387de)
JSON Schema grammar: string minLength, maxLength, format support (#388) (4d387de)
JSON Schema grammar: improve inferred types (#388) (4d387de)
function calling: params description support (#388) (4d387de)
function calling: document JSON Schema type properties on Functionary chat function types (#388) (4d387de)

Shipped with llama.cpp release b4234

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 16

0 Join discussion

31 Oct 01:39

github-actions

v3.2.0

6405ee9

v3.2.0

3.2.0 (2024-10-31)

Bug Fixes

Electron crash with some models on macOS when not using Metal (#375) (ea12dc5)
adapt to llama.cpp breaking changes (#375) (ea12dc5)
support rejectattr in Jinja templates (#376) (ea12dc5)
build warning on macOS (#377) (6405ee9)

Features

chat session response prefix (#375) (ea12dc5)
improve context shift strategy (#375) (ea12dc5)
use RAM and swap sizes in memory usage estimations (#375) (ea12dc5)
faster building from source (#375) (ea12dc5)
improve CPU compatibility score (#375) (ea12dc5)
inspect gguf command: print a single key flag (#375) (ea12dc5)

Shipped with llama.cpp release b3995

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 16

0 Join discussion

06 Oct 20:32

github-actions

v3.1.1

8145c94

v3.1.1

3.1.1 (2024-10-06)

Features

minor: reference common classes on the Llama instance (#360) (8145c94)

Shipped with llama.cpp release b3889

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 16

05 Oct 20:27

github-actions

v3.1.0

51eab61

v3.1.0

3.1.0 (2024-10-05)

Bug Fixes

improve metadata read times (#351) (4ee10a9)
hide internal type (#351) (4ee10a9)

Features

resolveModelFile method (#351) (4ee10a9)
hf: URI support (#351) (4ee10a9)

Shipped with llama.cpp release b3887

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 16

0 Join discussion

25 Sep 20:34

github-actions

v3.0.3

2e751c8

v3.0.3

✨ `node-llama-cpp` 3.0 is here! ✨

Read about the release in the blog post

3.0.3 (2024-09-25)

Bug Fixes

adapt to llama.cpp breaking change (#344) (2e751c8)

Shipped with llama.cpp release b3825

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 16

25 Sep 15:00

github-actions

v3.0.2

1291b97

v3.0.2

✨ `node-llama-cpp` 3.0 is here! ✨

Read about the release in the blog post

3.0.2 (2024-09-25)

Bug Fixes

node template: bug (#342) (1291b97)
use a compressed logo image for README.md (#340) (8ab983b)

Shipped with llama.cpp release b3821

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 2

24 Sep 04:11

github-actions

v3.0.1

ec45bbf

v3.0.1

✨ `node-llama-cpp` 3.0 is here! ✨

Read about the release in the blog post

3.0.1 (2024-09-24)

Bug Fixes

deploy docs website (#337) (ec45bbf)
release create-command package (#335) (51f4622)

Shipped with llama.cpp release b3808

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 16

24 Sep 01:38

github-actions

v3.0.0

97b0d86

v3.0.0

✨ `node-llama-cpp` 3.0 is here! ✨

Read about the release in the blog post

3.0.0 (2024-09-24)

Features

function calling (#139) (5fcdf9b)
get embedding for text (#144) (4cf1fba)
async model and context loading (#178) (315a3eb)
token biases (#196) (3ad4494)
automatic batching (#104) (4757af8)
prompt completion engine (#225) (95f4645)
model compatibility warnings (#225) (95f4645)
Vulkan support (#171) (d161bcd)
Windows on Arm prebuilt binary (#181) (f3b7f81)
change the default log level to warn (#191) (b542b53)
pull command (#214) (453c162)
inspect gpu command (#175) (5a70576)
inspect gguf command (#182) (35e6f50)
inspect estimate command (#309) (4b3ad61)
inspect measure command (#182) (35e6f50)
init command to scaffold a new project from a template (with node-typescript and electron-typescript-react templates) (#217) (d6a0f43)
move download, build and clear commands to be subcommands of a source command (#309) (4b3ad61)
move seed option to the prompt level (#309) (4b3ad61)
TemplateChatWrapper: custom history template for each message role (#309) (4b3ad61)
Llama 3.1 support (#273) (e3e0994)
Mistral chat wrapper (#309) (4b3ad61)
Functionary v3 support (#309) (4b3ad61)
Phi-3 support (#273) (e3e0994)
extract all prebuilt binaries to external modules (#309) (4b3ad61)
parallel function calling (#225) (95f4645)
preload prompt (#225) (95f4645)
onTextChunk option (#273) (e3e0994)
flash attention (#264) (c2e322c)
debug mode (#217) (d6a0f43)
load LoRA adapters (#217) (d6a0f43)
split gguf files support (#214) (453c162)
stopOnAbortSignal and customStopTriggers on LlamaChat and LlamaChatSession (#214) (453c162)
Llama 3 support (#205) (ef501f9)
--gpu flag in generation CLI commands (#205) (ef501f9)
specialTokens parameter on model.detokenize (#205) (ef501f9)
interactively select a model from CLI commands (#191) (b542b53)
automatically adapt to current free VRAM state (#182) (35e6f50)
GGUF file metadata info on LlamaModel (#182) (35e6f50)
use the tokenizer.chat_template header from the gguf file when available - use it to find a better specialized chat wrapper or use JinjaTemplateChatWrapper with it as a fallback (#182) (35e6f50)
simplify generation CLI commands: chat, complete, infill (#182) (35e6f50)
gguf parser (#168) (bcaab4f)
use the best compute layer available by default (#175) (5a70576)
more guardrails to prevent loading an incompatible prebuilt binary (#175) (5a70576)
completion and infill (#164) (ede69c1)
support configuring more options for getLlama when using "lastBuild" (#164) (ede69c1)
get VRAM state (#161) ([46235a2](https://github.com/withc...

Assets 2

0 Join discussion

Uh oh!

Releases: withcatai/node-llama-cpp

v3.3.2

3.3.2 (2024-12-27)

Bug Fixes

Uh oh!

v3.3.1

3.3.1 (2024-12-09)

Bug Fixes

Uh oh!

v3.3.0

3.3.0 (2024-12-02)

Bug Fixes

Features

Uh oh!

v3.2.0

3.2.0 (2024-10-31)

Bug Fixes

Features

Uh oh!

v3.1.1

3.1.1 (2024-10-06)

Features

Uh oh!

v3.1.0

3.1.0 (2024-10-05)

Bug Fixes

Features

Uh oh!

v3.0.3

✨ node-llama-cpp 3.0 is here! ✨

3.0.3 (2024-09-25)

Bug Fixes

Uh oh!

v3.0.2

✨ node-llama-cpp 3.0 is here! ✨

3.0.2 (2024-09-25)

Bug Fixes

Uh oh!

v3.0.1

✨ node-llama-cpp 3.0 is here! ✨

3.0.1 (2024-09-24)

Bug Fixes

Uh oh!

v3.0.0

✨ node-llama-cpp 3.0 is here! ✨

3.0.0 (2024-09-24)

Features

Uh oh!

✨ `node-llama-cpp` 3.0 is here! ✨

✨ `node-llama-cpp` 3.0 is here! ✨

✨ `node-llama-cpp` 3.0 is here! ✨

✨ `node-llama-cpp` 3.0 is here! ✨