v0.3.16-cu128-AVX2-linux-20251022

Latest

Latest

github-actions released this 21 Oct 21:47

· 3 commits to main since this release

v0.3.16-cu128-AVX2-linux-20251021

e7d36e4

feat: Update Submodule vendor/llama.cpp df1b612..03792ad
feat: Update some llama model parameters(check_tensors, use_extra_bufts, no_host)
feat: Sync model : Granite docling + Idefics3 preprocessing (SmolVLM)
feat: Sync server : context checkpointing for hybrid and recurrent models
feat: Sync llama: print memory breakdown on exit
feat: Synchronize some enum variable values
feat: Introducing index numbers to avoid the hallucination problem of multiple images entering the minicpm multimodal model series as much as possible

Assets 6