Actions: ggml-org/llama.cpp
Actions
6,732 workflow run results
6,732 workflow run results
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)
CI
#22915:
Pull request #13771
synchronize
by
ochafik
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)
CI
#22911:
Pull request #13771
synchronize
by
ochafik
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)
CI
#22909:
Pull request #13771
synchronize
by
ochafik
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)
CI
#22908:
Pull request #13771
synchronize
by
ochafik
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)
CI
#22901:
Pull request #13771
synchronize
by
ochafik
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)
CI
#22900:
Pull request #13771
synchronize
by
ochafik
server: add --reasoning-budget 0 to disable thinking (incl. qwen3 w/ enable_thinking:false)
CI
#22899:
Pull request #13771
opened
by
ochafik
server: streaming of tool calls and thoughts when --jinja is on (…
CI
#22892:
Commit f5cd27b
pushed
by
ochafik