Skip to content

Commit 4540100

Browse files
committed
addressing coderabbit feedback
1 parent 6357033 commit 4540100

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

modules/configuring-metric-based-autoscaling.adoc

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -53,9 +53,9 @@ spec:
5353
name: inference-prometheus-auth
5454
authModes:
5555
- bearer
56-
target:
57-
type: Value
58-
value: 2
56+
target:
57+
type: Value
58+
value: 2
5959
----
6060
+
6161
The example configuration sets up the inference service to autoscale between 1 and 5 replicas based on the number of requests waiting to be processed, as indicated by the `vllm:num_requests_waiting` metric.

0 commit comments

Comments
 (0)