File tree Expand file tree Collapse file tree 1 file changed +59
-0
lines changed Expand file tree Collapse file tree 1 file changed +59
-0
lines changed Original file line number Diff line number Diff line change @@ -158,4 +158,63 @@ spec:
158158{{< /tab >}}
159159{{< /tabs >}}
160160
161+ # # Scale Down Delay
162+
163+ Scale Down Delay specifies a time window which must pass at reduced concurrency
164+ before a scale-down decision is applied. This can be useful, for example, to
165+ keep containers around for a configurable duration to avoid a cold start
166+ penalty if new requests come in. Unlike setting a lower bound, the revision
167+ will eventually be scaled down if reduced concurrency is maintained for the
168+ delay period.
169+
170+ * **Global key:** `scale-down-delay`
171+ * **Per-revision annotation key:** `autoscaling.knative.dev/scaleDownDelay`
172+ * **Possible values:** Duration, `0s` <= value <= `1h`
173+ * **Default:** `0s` (no delay)
174+
175+ **Example:**
176+ {{< tabs name="scale-down-delay" default="Per Revision" >}}
177+ {{% tab name="Per Revision" %}}
178+ ` ` ` yaml
179+ apiVersion: serving.knative.dev/v1
180+ kind: Service
181+ metadata:
182+ name: helloworld-go
183+ namespace: default
184+ spec:
185+ template:
186+ metadata:
187+ annotations:
188+ autoscaling.knative.dev/scaleDownDelay: "15m"
189+ spec:
190+ containers:
191+ - image: gcr.io/knative-samples/helloworld-go
192+ ` ` `
193+ {{< /tab >}}
194+ {{% tab name="Global (ConfigMap)" %}}
195+ ` ` ` yaml
196+ apiVersion: v1
197+ kind: ConfigMap
198+ metadata:
199+ name: config-autoscaler
200+ namespace: knative-serving
201+ data:
202+ scale-down-delay: "15m"
203+ ` ` `
204+ {{< /tab >}}
205+ {{% tab name="Global (Operator)" %}}
206+ ` ` ` yaml
207+ apiVersion: operator.knative.dev/v1alpha1
208+ kind: KnativeServing
209+ metadata:
210+ name: knative-serving
211+ spec:
212+ config:
213+ autoscaler:
214+ scale-down-delay: "15m"
215+ ` ` `
216+
217+ {{< /tab >}}
218+ {{< /tabs >}}
219+
161220---
You can’t perform that action at this time.
0 commit comments