We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent ca71fb9 commit c5fef0fCopy full SHA for c5fef0f
tools/server/README.md
@@ -1045,6 +1045,7 @@ Available metrics:
1045
- `llamacpp:kv_cache_tokens`: KV-cache tokens.
1046
- `llamacpp:requests_processing`: Number of requests processing.
1047
- `llamacpp:requests_deferred`: Number of requests deferred.
1048
+- `llamacpp:n_past_max`: High watermark of the context size observed.
1049
1050
### POST `/slots/{id_slot}?action=save`: Save the prompt cache of the specified slot to a file.
1051
0 commit comments