Skip to content

Conversation

@irar2
Copy link
Collaborator

@irar2 irar2 commented Sep 9, 2025

Closes #179

This PR implements the setting of KV cache usage metric.
In addition, in moves KV cache insertion to the actual request execution, and the deletion to the end of the execution.

@irar2 irar2 requested a review from mayabar September 9, 2025 06:03
Signed-off-by: Ira <[email protected]>
Signed-off-by: Ira <[email protected]>
@mayabar
Copy link
Collaborator

mayabar commented Sep 9, 2025

/lgtm
/approve

@github-actions github-actions bot added the lgtm label Sep 9, 2025
@mayabar mayabar merged commit 40ec02c into llm-d:main Sep 9, 2025
4 checks passed
@irar2 irar2 deleted the kvmetrics branch September 9, 2025 07:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Expose Prometheus metrics related to KV Cache

2 participants