Skip to content

Commit 8db736a

Browse files
authored
Document Scale Down Delay (#2857)
* Document Scale Down Delay * capital change
1 parent 1ed08d3 commit 8db736a

File tree

1 file changed

+59
-0
lines changed

1 file changed

+59
-0
lines changed

docs/serving/autoscaling/scale-bounds.md

Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -158,4 +158,63 @@ spec:
158158
{{< /tab >}}
159159
{{< /tabs >}}
160160

161+
## Scale Down Delay
162+
163+
Scale Down Delay specifies a time window which must pass at reduced concurrency
164+
before a scale-down decision is applied. This can be useful, for example, to
165+
keep containers around for a configurable duration to avoid a cold start
166+
penalty if new requests come in. Unlike setting a lower bound, the revision
167+
will eventually be scaled down if reduced concurrency is maintained for the
168+
delay period.
169+
170+
* **Global key:** `scale-down-delay`
171+
* **Per-revision annotation key:** `autoscaling.knative.dev/scaleDownDelay`
172+
* **Possible values:** Duration, `0s` <= value <= `1h`
173+
* **Default:** `0s` (no delay)
174+
175+
**Example:**
176+
{{< tabs name="scale-down-delay" default="Per Revision" >}}
177+
{{% tab name="Per Revision" %}}
178+
```yaml
179+
apiVersion: serving.knative.dev/v1
180+
kind: Service
181+
metadata:
182+
name: helloworld-go
183+
namespace: default
184+
spec:
185+
template:
186+
metadata:
187+
annotations:
188+
autoscaling.knative.dev/scaleDownDelay: "15m"
189+
spec:
190+
containers:
191+
- image: gcr.io/knative-samples/helloworld-go
192+
```
193+
{{< /tab >}}
194+
{{% tab name="Global (ConfigMap)" %}}
195+
```yaml
196+
apiVersion: v1
197+
kind: ConfigMap
198+
metadata:
199+
name: config-autoscaler
200+
namespace: knative-serving
201+
data:
202+
scale-down-delay: "15m"
203+
```
204+
{{< /tab >}}
205+
{{% tab name="Global (Operator)" %}}
206+
```yaml
207+
apiVersion: operator.knative.dev/v1alpha1
208+
kind: KnativeServing
209+
metadata:
210+
name: knative-serving
211+
spec:
212+
config:
213+
autoscaler:
214+
scale-down-delay: "15m"
215+
```
216+
217+
{{< /tab >}}
218+
{{< /tabs >}}
219+
161220
---

0 commit comments

Comments
 (0)