v0.3.16-cu128-AVX2-win-20251108

github-actions released this 08 Nov 08:36

· 56 commits to main since this release

3d96053

feat: Update Submodule vendor/llama.cpp 48bd265..299f5d7
feat: Update llama.cpp API and supplementing the State/sessions API
feat: Better Qwen3VL chat template. (Thank to @alcoftTAO)

Note: llama_chat_template now allows for more flexible input of parameters required by the model and the application of more complex Jinja formats.
The initial input parameters for Qwen3VLChatHandler have changed: "use_think_prompt" has been changed to "force_reasoning".

Contributors

alcoftTAO

Assets 6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.3.16-cu128-AVX2-win-20251108

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Contributors

Uh oh!