3.14.0 (2025-10-02)
Features
- Qwen3 Reranker support (#506) (00305f7) (see #506 for prequantized Qwen3 Reranker models you can use)
Bug Fixes
- handle HuggingFace rate limit responses (#506) (00305f7)
- adapt to
llama.cpp
breaking changes (#506) (00305f7)
Shipped with llama.cpp
release b6673
To use the latest
llama.cpp
release available, runnpx -n node-llama-cpp source download --release latest
. (learn more)