Skip to content

Conversation

@zoq
Copy link

@zoq zoq commented Oct 21, 2025

Added the bench parameter that allows us to run the llama-cli multiple times, also extended the metrics we report at the end.

./build/bin/llama-cli -m ./Qwen3_0.6B.Q8_0.gguf -ngl 999 -c 2048 -s 42 --temp 0 --top-p 1.0 --top-k 0 --flash-attn off -st -p "Tell me a joke about cats" --bench 5

@olyasir olyasir merged commit a438af4 into tetherto:temp-latest-finetuning Oct 24, 2025
41 of 47 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants