Skip to content

Commit b94412b

Browse files
committed
Update README.md
1 parent 1583908 commit b94412b

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -111,13 +111,13 @@ cmake --build build -j"$(nproc)"
111111
./build/bin/llama-bench \
112112
--amx \
113113
-m /path-to-your-model.gguf \
114-
-t 32 -ngl 10 -b 256 -ub 256 -pg 1024 --no-warmup
114+
-t 32 -ngl 10 -nopo 1 -b 512 -ub 512 -pg 512,512 --repetitions 3
115115
116116
# CLI (hybrid) quick generation
117117
./build/bin/llama-cli \
118118
--amx \
119119
-m /path-to-your-model.gguf \
120-
-t 32 -ngl 10 -c 4096 -n 64 -p "10 facts about birds" --no-warmup
120+
-ngl 10 -t 32 -b 4096 -c 4096 -n 512 -p "10 facts about birds" -no-cnv
121121
122122
# Server (hybrid) – default port 8080
123123
./build/bin/llama-server --amx \

0 commit comments

Comments
 (0)