We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 9818033 commit 6f601f4Copy full SHA for 6f601f4
docs/user_guide/metrics.md
@@ -378,3 +378,9 @@ Further documentation can be found in the `TRITONSERVER_MetricFamily*` and
378
The TRT-LLM backend uses the custom metrics API to track and expose specific metrics about
379
LLMs, KV Cache, and Inflight Batching to Triton:
380
https://github.com/triton-inference-server/tensorrtllm_backend?tab=readme-ov-file#triton-metrics
381
+
382
+### vLLM Backend Metrics
383
384
+The vLLM backend uses the custom metrics API to track and expose specific metrics about
385
+LLMs to Triton:
386
+https://github.com/triton-inference-server/vllm_backend?tab=readme-ov-file#triton-metrics
0 commit comments