We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 3edee95 commit a23b572Copy full SHA for a23b572
ai/vllm-deployment/hpa/README.md
@@ -24,8 +24,8 @@ The autoscaling solution works as follows:
24
flowchart TD
25
D("PrometheusRule (GPU Metric Only)")
26
B("Prometheus Server")
27
+ C("ServiceMonitor")
28
subgraph subGraph0["Metrics Collection"]
- C("ServiceMonitor")
29
A["vLLM Server"]
30
H["GPU DCGM Exporter"]
31
end
@@ -34,9 +34,9 @@ flowchart TD
34
F("API Server (Custom Metrics)")
35
G("HPA Controller")
36
37
- A -- Scrapes Raw Metrics --> C
38
- H -- Scrapes Raw Metrics --> C
39
- C -- Configures Scrape --> B
+ B -- Scrapes Raw Metrics --> A
+ B -- Scrapes Raw Metrics --> H
+ C -- Configures Scrape <--> B
40
B -- Processes Raw Metrics via --> D
41
D -- Creates Clean Metric in --> B
42
F -- Custom Metrics API <--> E
0 commit comments