Skip to content

Commit 7ab6685

Browse files
committed
server : update help metrics processing/deferred
This commit updates the help text for the metrics `requests_processing` and `requests_deferred` to be more grammatically correct. Currently the returned metrics look like this: ```console \# HELP llamacpp:requests_processing Number of request processing. \# TYPE llamacpp:requests_processing gauge llamacpp:requests_processing 0 \# HELP llamacpp:requests_deferred Number of request deferred. \# TYPE llamacpp:requests_deferred gauge llamacpp:requests_deferred 0 ``` With this commit, the metrics will look like this: ```console \# HELP llamacpp:requests_processing Number of requests processing. \# TYPE llamacpp:requests_processing gauge llamacpp:requests_processing 0 \# HELP llamacpp:requests_deferred Number of requests deferred. \# TYPE llamacpp:requests_deferred gauge llamacpp:requests_deferred 0 ``` This is also consistent with the description of the metrics in the server examples [README.md](https://github.com/ggerganov/llama.cpp/tree/master/examples/server#get-metrics-prometheus-compatible-metrics-exporter).
1 parent 4314e56 commit 7ab6685

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

examples/server/server.cpp

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3575,11 +3575,11 @@ int main(int argc, char ** argv) {
35753575
{"value", (uint64_t) res_metrics->kv_cache_tokens_count}
35763576
},{
35773577
{"name", "requests_processing"},
3578-
{"help", "Number of request processing."},
3578+
{"help", "Number of requests processing."},
35793579
{"value", (uint64_t) res_metrics->n_processing_slots}
35803580
},{
35813581
{"name", "requests_deferred"},
3582-
{"help", "Number of request deferred."},
3582+
{"help", "Number of requests deferred."},
35833583
{"value", (uint64_t) res_metrics->n_tasks_deferred}
35843584
}}}
35853585
};

0 commit comments

Comments
 (0)