Skip to content

Commit 43ccf81

Browse files
committed
server : fix tests to pass --no-slots when necessary
1 parent a14483f commit 43ccf81

File tree

2 files changed

+3
-1
lines changed

2 files changed

+3
-1
lines changed

tools/server/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -185,8 +185,8 @@ The project is under active development, and we are [looking for feedback and co
185185
| `--threads-http N` | number of threads used to process HTTP requests (default: -1)<br/>(env: LLAMA_ARG_THREADS_HTTP) |
186186
| `--cache-reuse N` | min chunk size to attempt reusing from the cache via KV shifting (default: 0)<br/>[(card)](https://ggml.ai/f0.png)<br/>(env: LLAMA_ARG_CACHE_REUSE) |
187187
| `--metrics` | enable prometheus compatible metrics endpoint (default: disabled)<br/>(env: LLAMA_ARG_ENDPOINT_METRICS) |
188-
| `--slots` | enable slots monitoring endpoint (default: disabled)<br/>(env: LLAMA_ARG_ENDPOINT_SLOTS) |
189188
| `--props` | enable changing global properties via POST /props (default: disabled)<br/>(env: LLAMA_ARG_ENDPOINT_PROPS) |
189+
| `--slots` | enable slots monitoring endpoint (default: enabled)<br/>(env: LLAMA_ARG_ENDPOINT_SLOTS) |
190190
| `--no-slots` | disables slots monitoring endpoint<br/>(env: LLAMA_ARG_NO_ENDPOINT_SLOTS) |
191191
| `--slot-save-path PATH` | path to save slot kv cache (default: disabled) |
192192
| `--jinja` | use jinja template for chat (default: disabled)<br/>(env: LLAMA_ARG_JINJA) |

tools/server/tests/utils.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -148,6 +148,8 @@ def start(self, timeout_seconds: int | None = DEFAULT_HTTP_TIMEOUT) -> None:
148148
server_args.append("--metrics")
149149
if self.server_slots:
150150
server_args.append("--slots")
151+
else:
152+
server_args.append("--no-slots")
151153
if self.pooling:
152154
server_args.extend(["--pooling", self.pooling])
153155
if self.model_alias:

0 commit comments

Comments
 (0)