Skip to content

v3.14.0

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 02 Oct 21:53
02805ee

3.14.0 (2025-10-02)

Features

  • Qwen3 Reranker support (#506) (00305f7) (see #506 for prequantized Qwen3 Reranker models you can use)

Bug Fixes

  • handle HuggingFace rate limit responses (#506) (00305f7)
  • adapt to llama.cpp breaking changes (#506) (00305f7)

Shipped with llama.cpp release b6673

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)