server : update help metrics processing/deferred #11512

danbev · 2025-01-30T13:01:53Z

This commit updates the help text for the metrics requests_processing and requests_deferred to be more grammatically correct.

Currently the returned metrics look like this:

\# HELP llamacpp:requests_processing Number of request processing.
\# TYPE llamacpp:requests_processing gauge
llamacpp:requests_processing 0
\# HELP llamacpp:requests_deferred Number of request deferred.
\# TYPE llamacpp:requests_deferred gauge
llamacpp:requests_deferred 0

With this commit, the metrics will look like this:

\# HELP llamacpp:requests_processing Number of requests processing.
\# TYPE llamacpp:requests_processing gauge
llamacpp:requests_processing 0
\# HELP llamacpp:requests_deferred Number of requests deferred.
\# TYPE llamacpp:requests_deferred gauge
llamacpp:requests_deferred 0

This is also consistent with the description of the metrics in the server examples README.md.

This commit updates the help text for the metrics `requests_processing` and `requests_deferred` to be more grammatically correct. Currently the returned metrics look like this: ```console \# HELP llamacpp:requests_processing Number of request processing. \# TYPE llamacpp:requests_processing gauge llamacpp:requests_processing 0 \# HELP llamacpp:requests_deferred Number of request deferred. \# TYPE llamacpp:requests_deferred gauge llamacpp:requests_deferred 0 ``` With this commit, the metrics will look like this: ```console \# HELP llamacpp:requests_processing Number of requests processing. \# TYPE llamacpp:requests_processing gauge llamacpp:requests_processing 0 \# HELP llamacpp:requests_deferred Number of requests deferred. \# TYPE llamacpp:requests_deferred gauge llamacpp:requests_deferred 0 ``` This is also consistent with the description of the metrics in the server examples [README.md](https://github.com/ggerganov/llama.cpp/tree/master/examples/server#get-metrics-prometheus-compatible-metrics-exporter).

danbev requested a review from ngxson as a code owner January 30, 2025 13:01

github-actions bot added examples server labels Jan 30, 2025

ngxson approved these changes Jan 30, 2025

View reviewed changes

danbev merged commit a2df278 into ggml-org:master Jan 31, 2025
45 checks passed

danbev deleted the server-metrics-requests branch January 31, 2025 08:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

server : update help metrics processing/deferred #11512

server : update help metrics processing/deferred #11512

Uh oh!

danbev commented Jan 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

server : update help metrics processing/deferred #11512

server : update help metrics processing/deferred #11512

Uh oh!

Conversation

danbev commented Jan 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants