Skip to content

Navigation Menu

Appearance settings

GitHub Copilot
Write better code with AI

GitHub Spark New
Build and deploy intelligent apps

GitHub Models New
Manage and compare prompts

GitHub Advanced Security
Find and fix vulnerabilities

Actions
Automate any workflow
Codespaces
Instant dev environments

Issues
Plan and track work

Code Review
Manage code changes

Discussions
Collaborate outside of code

Code Search
Find more, search less
Explore

Why GitHub

Documentation

GitHub Skills

Blog
Integrations

GitHub Marketplace

MCP Registry
View all features
By company size

Enterprises

Small and medium teams

Startups

Nonprofits
By use case

App Modernization

DevSecOps

DevOps

CI/CD

View all use cases
By industry

Healthcare

Financial services

Manufacturing

Government

View all industries
View all solutions
Topics

AI

DevOps

Security

Software Development

View all
Explore

Learning Pathways

Events & Webinars

Ebooks & Whitepapers

Customer Stories

Partners

Executive Insights
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories

Topics

Trending

Collections
Enterprise platform
AI-powered developer platform
Available add-ons

GitHub Advanced Security
Enterprise-grade security features

Copilot for business
Enterprise-grade AI features

Premium Support
Enterprise-grade 24/7 support
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

withcatai / node-llama-cpp Public

Notifications You must be signed in to change notification settings
Fork 147
Star 1.7k

Code
Issues 9
Pull requests
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Releases: withcatai/node-llama-cpp

Releases · withcatai/node-llama-cpp

v3.0.0-beta.45

19 Sep 19:11

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v3.0.0-beta.45 Pre-release

Pre-release

3.0.0-beta.45 (2024-09-19)

Bug Fixes

improve performance of parallel evaluation from multiple contexts (#309) (4b3ad61)
Llama 3.1 chat wrapper standard chat history (#309) (4b3ad61)
adapt to llama.cpp sampling refactor (#309) (4b3ad61)
Llama 3 Instruct function calling (#309) (4b3ad61)
don't preload prompt in the chat command when using --printTimings or --meter (#309) (4b3ad61)
more stable Jinja template matching (#309) (4b3ad61)

Features

inspect estimate command (#309) (4b3ad61)
move seed option to the prompt level (#309) (4b3ad61)
Functionary v3 support (#309) (4b3ad61)
Mistral chat wrapper (#309) (4b3ad61)
improve Llama 3.1 chat template detection (#309) (4b3ad61)
change autoDisposeSequence default to false (#309) (4b3ad61)
move download, build and clear commands to be subcommands of a source command (#309) (4b3ad61)
simplify TokenBias (#309) (4b3ad61)
better threads default value (#309) (4b3ad61)
make LlamaEmbedding an object (#309) (4b3ad61)
HF_TOKEN env var support for reading GGUF file metadata (#309) (4b3ad61)
TemplateChatWrapper: custom history template for each message role (#309) (4b3ad61)
more helpful inspect gpu command (#309) (4b3ad61)
all tokenizer tokens iterator (#309) (4b3ad61)
failed context creation automatic remedy (#309) (4b3ad61)
abort generation support in CLI commands (#309) (4b3ad61)
--gpuLayers max and --contextSize max flag support for inspect estimate command (#309) (4b3ad61)
extract all prebuilt binaries to external modules (#309) (4b3ad61)
updated docs (#309) (4b3ad61)
combine model downloaders (#309) (4b3ad61)
feat(electron example template): update badge, scroll anchoring, table support (#309) (4b3ad61)

Shipped with llama.cpp release b3785

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)

Assets 16

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

0 Join discussion

v2.8.16

03 Sep 02:01

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v2.8.16

2.8.16 (2024-09-03)

Bug Fixes

bump llama.cpp release used in prebuilt binaries (#305) (660651a)
update documentation website URL (#306) (51265c8)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

v3.0.0-beta.44

10 Aug 00:25

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v3.0.0-beta.44 Pre-release

Pre-release

3.0.0-beta.44 (2024-08-10)

Bug Fixes

revert to the latest stable Metal llama.cpp release (#297) (bf12e9c)

Shipped with llama.cpp release b3543

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 16

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

0 Join discussion

v3.0.0-beta.43

09 Aug 21:32

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v3.0.0-beta.43 Pre-release

Pre-release

3.0.0-beta.43 (2024-08-09)

Bug Fixes

more cases of unknown characters in generation streaming (#295) (ecaef63)

Shipped with llama.cpp release b3560

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 16

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

0 Join discussion

v3.0.0-beta.42

07 Aug 21:26

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v3.0.0-beta.42 Pre-release

Pre-release

3.0.0-beta.42 (2024-08-07)

Bug Fixes

unkown characters in generation streaming (#293) (097b3ec)

Shipped with llama.cpp release b3541

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 16

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

0 Join discussion

v2.8.15

06 Aug 21:59

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v2.8.15

2.8.15 (2024-08-06)

Bug Fixes

adapt to llama.cpp breaking changes (#291) (c4b5d80)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

v3.0.0-beta.41

02 Aug 20:56

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v3.0.0-beta.41 Pre-release

Pre-release

3.0.0-beta.41 (2024-08-02)

Bug Fixes

CUDA context creation (#285) (a2b2bc3)
detokenizer unpredictable text length (#285) (a2b2bc3)

Shipped with llama.cpp release b3504

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 16

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

0 Join discussion

v3.0.0-beta.40

30 Jul 17:58

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v3.0.0-beta.40 Pre-release

Pre-release

3.0.0-beta.40 (2024-07-30)

Bug Fixes

update model recommendations (#276) (826334b)

Features

model downloader: use HF_TOKEN when needed (#276) (826334b)

Shipped with llama.cpp release b3488

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 16

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

0 Join discussion

v3.0.0-beta.39

28 Jul 00:51

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v3.0.0-beta.39 Pre-release

Pre-release

3.0.0-beta.39 (2024-07-28)

Bug Fixes

Gemma chat wrapper bug (#273) (e3e0994)
GGUF metadata nested key conflicts (#273) (e3e0994)
adapt to llama.cpp breaking changes (#273) (e3e0994)
preserve function calling chunks (#273) (e3e0994)
format JSON objects like models expect (#273) (e3e0994)

Features

Llama 3.1 support (#273) (e3e0994)
Phi-3 support (#273) (e3e0994)
model metadata overrides (#273) (e3e0994)
use LoRA on a context instead of on a model (#273) (e3e0994)
onTextChunk option (#273) (e3e0994)

Shipped with llama.cpp release b3479

To use the latest llama.cpp release available, run npx --no node-llama-cpp download --release latest. (learn more)

Assets 16

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

0 Join discussion

v2.8.14

26 Jul 00:53

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

v2.8.14

2.8.14 (2024-07-26)

Bug Fixes

prebuilt binaries (#272) (e539e5b)

Assets 2

Loading

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Previous 1 2 3 4 5 6 … 11 12 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Community
Docs
Contact

You can’t perform that action at this time.