v0.3.16-cu128-AVX2-linux-20251031
·
77 commits
to main
since this release
New Update: Support for Qwen3VL GGUF
feat: Update README.md for Qwen3VL example(Thinking/No Thinking)
feat: feat: Add Qwen3VLChatHandler into llama_chat_format.py
feat: Update llama.cpp api 20251031
update: Update Submodule vendor/llama.cpp 16724b5..8da3c0e