Skip to content

Conversation

@giladgd
Copy link
Member

@giladgd giladgd commented Jan 5, 2026

Description of change

  • feat(LlamaCompletion): stopOnAbortSignal
  • feat(LlamaModel): useDirectIo
  • fix: support new CUDA 13.1 archs
  • fix: build the prebuilt binaries with CUDA 13.1 instead of 13.0
  • docs: stopping a text completion generation

Fixes #539

Pull-Request Checklist

  • Code is up-to-date with the master branch
  • npm run format to apply eslint formatting
  • npm run test passes with this change
  • This pull request links relevant issues as Fixes #0000
  • There are new or updated unit tests validating the change
  • Documentation has been updated to reflect this change
  • The new commits and pull request title follow conventions explained in pull request guidelines (PRs that do not follow this convention will not be merged)

@giladgd giladgd requested a review from ido-pluto January 5, 2026 11:47
@giladgd giladgd self-assigned this Jan 5, 2026
@giladgd giladgd changed the title fix: support new CUDA 13.1 archs feat(LlamaCompletion): stopOnAbortSignal Jan 7, 2026
@giladgd giladgd merged commit 734693d into master Jan 10, 2026
20 checks passed
@giladgd giladgd deleted the gilad/cuda13.1 branch January 10, 2026 18:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

docs: No information how to stop the inference

3 participants