Skip to content
igardev edited this page Jan 5, 2026 · 31 revisions

Version 0.0.40 is released (05.01.2025)

What is new

Generation of multiple completions in parallel:

  • Setting max_parallel_completions determines how many completions to generate in parallel (default 3)
  • Shortcuts - Alt+] - next completion, Alt+[ - previous completion
  • Requires llama.cpp after December, 6, 2025 (commit c42712b) but is backword compatible (generates one completion for older versions)
  • More details

Setup instructions for llama.cpp server

More details about llama.cpp server

Features

Clone this wiki locally