feat: chat session response prefix #375

giladgd · 2024-10-27T00:04:39Z

Description of change

feat: chat session response prefix
feat: improve context shift strategy
feat: use RAM and swap sizes in memory usage estimations
feat(inspect gguf command): print a single key flag
feat: faster building from source
fix: Electron crash with some models on macOS when not using Metal (via fix: use vm_allocate to allocate CPU backend buffer on macOS ggml-org/llama.cpp#9875)
fix: adapt to llama.cpp breaking changes
fix: improve CPU compatibility score
docs: mention the need to treat node-llama-cpp as an external module in Electron
docs: sitemap fixes
docs(README): make the logo be a link
chore: update issue config

Fixes #361

Pull-Request Checklist

Code is up-to-date with the master branch
npm run format to apply eslint formatting
npm run test passes with this change
This pull request links relevant issues as Fixes #0000
There are new or updated unit tests validating the change
Documentation has been updated to reflect this change
The new commits and pull request title follow conventions explained in pull request guidelines (PRs that do not follow this convention will not be merged)

…e in Electron

ido-pluto

LGTM

github-actions · 2024-10-31T01:39:17Z

🎉 This PR is included in version 3.2.0 🎉

The release is available on:

Your semantic-release bot 📦🚀

giladgd added 9 commits October 27, 2024 01:21

feat: chat session response prefix

bcfd86e

docs: sitemap fixes

ce4efc3

chore: update issue config

16c433d

feat: use RAM and swap sizes in memory usage estimations

3b195db

fix: adapt to llama.cpp breaking changes

76dea80

feat: improve context shift strategy

1676d41

feat: faster building from source

c4f94d2

docs(README): make the logo be a link

e5bae5e

docs: mention the need to treat node-llama-cpp as an external modul…

f0c630d

…e in Electron

giladgd requested a review from ido-pluto October 27, 2024 00:04

giladgd self-assigned this Oct 27, 2024

giladgd added 4 commits October 27, 2024 03:07

feat(inspect gguf command): print a single key flag

0e83243

docs: responsePrefix example

b1dff17

fix: bug

67a0e80

fix: show model RAM usage in CLI commands

df2d75a

ido-pluto approved these changes Oct 27, 2024

View reviewed changes

giladgd added 5 commits October 28, 2024 00:47

build: faster model dependent tests

9dbfa4f

build: faster model dependent tests

eb7e170

build: faster model dependent tests

e4841a5

test: fix test

1720a5c

fix: improve CPU compatibility score

538c414

giladgd merged commit ea12dc5 into master Oct 29, 2024
14 checks passed

giladgd deleted the gilad/chatResponsePrefix branch October 29, 2024 23:54

github-actions bot added the released label Oct 31, 2024

0xdevalias mentioned this pull request Feb 7, 2025

Bump node-llama-cpp from 3.0.0-beta.44 to 3.5.0 jehna/humanify#301

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: chat session response prefix #375

feat: chat session response prefix #375

Uh oh!

giladgd commented Oct 27, 2024 •

edited

Loading

Uh oh!

ido-pluto left a comment

Uh oh!

Uh oh!

github-actions bot commented Oct 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

feat: chat session response prefix #375

feat: chat session response prefix #375

Uh oh!

Conversation

giladgd commented Oct 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of change

Pull-Request Checklist

Uh oh!

ido-pluto left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Oct 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

giladgd commented Oct 27, 2024 •

edited

Loading