Skip to content

Commit a0dd85c

Browse files
authored
readme : minor simplifications of the server commands
1 parent dfb0fb9 commit a0dd85c

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,7 @@ Here are recommended settings, depending on the amount of VRAM that you have:
102102
```bash
103103
llama-server \
104104
-hf ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF \
105-
--port 8012 -ngl 99 -fa -ub 1024 -b 1024 -dt 0.1 \
105+
--port 8012 -ngl 99 -fa -ub 1024 -b 1024 \
106106
--ctx-size 0 --cache-reuse 256
107107
```
108108

@@ -111,7 +111,7 @@ Here are recommended settings, depending on the amount of VRAM that you have:
111111
```bash
112112
llama-server \
113113
-hf ggml-org/Qwen2.5-Coder-3B-Q8_0-GGUF \
114-
--port 8012 -ngl 99 -fa -ub 1024 -b 1024 -dt 0.1 \
114+
--port 8012 -ngl 99 -fa -ub 1024 -b 1024 \
115115
--ctx-size 0 --cache-reuse 256
116116
```
117117

@@ -120,7 +120,7 @@ Here are recommended settings, depending on the amount of VRAM that you have:
120120
```bash
121121
llama-server \
122122
-hf ggml-org/Qwen2.5-Coder-1.5B-Q8_0-GGUF \
123-
--port 8012 -ngl 99 -fa -ub 1024 -b 1024 -dt 0.1 \
123+
--port 8012 -ngl 99 -fa -ub 1024 -b 1024 \
124124
--ctx-size 0 --cache-reuse 256
125125
```
126126

0 commit comments

Comments
 (0)