Skip to content

Commit c5fef0f

Browse files
authored
server: update readme to mention n_past_max metric (#16436)
#15361 added new metric exported, but I've missed this doc.
1 parent ca71fb9 commit c5fef0f

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

tools/server/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1045,6 +1045,7 @@ Available metrics:
10451045
- `llamacpp:kv_cache_tokens`: KV-cache tokens.
10461046
- `llamacpp:requests_processing`: Number of requests processing.
10471047
- `llamacpp:requests_deferred`: Number of requests deferred.
1048+
- `llamacpp:n_past_max`: High watermark of the context size observed.
10481049

10491050
### POST `/slots/{id_slot}?action=save`: Save the prompt cache of the specified slot to a file.
10501051

0 commit comments

Comments
 (0)