Skip to content

Commit a2df278

Browse files
authored
server : update help metrics processing/deferred (ggml-org#11512)
This commit updates the help text for the metrics `requests_processing` and `requests_deferred` to be more grammatically correct. Currently the returned metrics look like this: ```console \# HELP llamacpp:requests_processing Number of request processing. \# TYPE llamacpp:requests_processing gauge llamacpp:requests_processing 0 \# HELP llamacpp:requests_deferred Number of request deferred. \# TYPE llamacpp:requests_deferred gauge llamacpp:requests_deferred 0 ``` With this commit, the metrics will look like this: ```console \# HELP llamacpp:requests_processing Number of requests processing. \# TYPE llamacpp:requests_processing gauge llamacpp:requests_processing 0 \# HELP llamacpp:requests_deferred Number of requests deferred. \# TYPE llamacpp:requests_deferred gauge llamacpp:requests_deferred 0 ``` This is also consistent with the description of the metrics in the server examples [README.md](https://github.com/ggerganov/llama.cpp/tree/master/examples/server#get-metrics-prometheus-compatible-metrics-exporter).
1 parent 553f1e4 commit a2df278

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

examples/server/server.cpp

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3633,11 +3633,11 @@ int main(int argc, char ** argv) {
36333633
{"value", (uint64_t) res_metrics->kv_cache_tokens_count}
36343634
},{
36353635
{"name", "requests_processing"},
3636-
{"help", "Number of request processing."},
3636+
{"help", "Number of requests processing."},
36373637
{"value", (uint64_t) res_metrics->n_processing_slots}
36383638
},{
36393639
{"name", "requests_deferred"},
3640-
{"help", "Number of request deferred."},
3640+
{"help", "Number of requests deferred."},
36413641
{"value", (uint64_t) res_metrics->n_tasks_deferred}
36423642
}}}
36433643
};

0 commit comments

Comments
 (0)