You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pages/serverless-containers/reference-content/containers-autoscaling.mdx
+35-5Lines changed: 35 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,19 +18,27 @@ categories:
18
18
19
19
[Autoscaling](/serverless-containers/concepts/#autoscaling) offers several benefits, including improved responsiveness and cost efficiency. By automatically adjusting the number of instances of your resource based on current demand, you ensure that your applications can handle varying loads without manual intervention. This not only enhances user experience by maintaining performance levels but also helps in reducing costs by only using resources when necessary. Additionally, autoscaling helps in optimal resource utilization, minimizing the risk of performance degradation during peak times.
20
20
21
-
## Minimum and maximum scales
21
+
Autoscaling can be based on exactly one of the following parameters:
22
22
23
-
### Minimum scale (min-scale)
23
+
- Request concurrency
24
+
- CPU percentage
25
+
- RAM percentage
24
26
25
-
This parameter sets the lowest value your resource is allowed to scale down to:
27
+
## Request concurrency
28
+
29
+
### Minimum and maximum scales
30
+
31
+
#### Minimum scale (min-scale)
32
+
33
+
This parameter sets the minimum number of instances your resource is allowed to scale down to:
26
34
27
35
- If you set a value of `0`, all instances of your resource will be terminated after 15 minutes of inactivity.
28
36
29
37
- If you set a value of `1` or more, the corresponding number of instances of your resource will remain available at all time.
30
38
31
39
Customizing the minimum scale for Serverless can help ensure that an instance remains pre-allocated and ready to handle requests, reducing delays associated with [cold starts](/serverless-containers/concepts/#cold-start). However, this setting also impacts the costs of your Serverless Container.
32
40
33
-
### Maximum scale (max-scale)
41
+
####Maximum scale (max-scale)
34
42
35
43
This parameter sets the maximum number of instances of your resource. You should adjust it based on your resource's traffic spikes, keeping in mind that you may wish to limit the max scale to manage costs effectively.
36
44
@@ -51,4 +59,26 @@ The same autoscaler decides to remove instances when:
51
59
52
60
<Messagetype="note">
53
61
Redeploying your resource results in the termination of existing instances and a return to the min scale, which you observe when redeploying.
54
-
</Message>
62
+
</Message>
63
+
64
+
## CPU and RAM percentage
65
+
66
+
### Minimum and maximum scales
67
+
68
+
#### Minimum scale (min-scale)
69
+
70
+
This parameter sets the minimum number of instances your resource is allowed to scale down to.
71
+
72
+
<Messagetype="note">
73
+
For technical reasons, the minimum scale for CPU/RAM-based autoscaling is `1`.
74
+
</Message>
75
+
76
+
#### Maximum scale (max-scale)
77
+
78
+
This parameter sets the maximum number of instances of your resource. You should adjust it based on your resource's CPU or RAM usage spikes, keeping in mind that you may wish to limit the max scale to manage costs effectively.
79
+
80
+
### Autoscaler behavior
81
+
82
+
The autoscaler decides to start new instances when the existing instances' CPU or RAM usage exceeds the threshold you defined for a certain amount of time.
83
+
84
+
The same autoscaler decides to remove existing instances when the CPU or RAM usage of certain instances is reduced, and the remaining instances' usage does not exceed the threshold.
0 commit comments