You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
You can automatically scale a workload vertically using a _VerticalPodAutoscaler_ (VPA).
61
58
Different to the HPA, the VPA doesn't come with Kubernetes by default, but is a separate project
62
59
that can be found [on GitHub](https://github.com/kubernetes/autoscaler/tree/9f87b78df0f1d6e142234bb32e8acbd71295585a/vertical-pod-autoscaler).
@@ -65,12 +62,11 @@ Once installed, it allows you to create {{< glossary_tooltip text="CustomResourc
65
62
(CRDs) for your workloads which define _how_ and _when_ to scale the resources of the managed replicas.
66
63
67
64
{{< note >}}
68
-
The current default version of the HPA (**v0.14.0**) requires **Kubernetes version 1.25** or later.
69
-
You will also need to have the [Metrics Server](https://github.com/kubernetes-sigs/metrics-server)
70
-
installed to your cluster.
65
+
You will need to have the [Metrics Server](https://github.com/kubernetes-sigs/metrics-server)
66
+
installed to your cluster for the HPA to work.
71
67
{{< /note >}}
72
68
73
-
At the moment, the VPA operates can operate in four different modes:
69
+
At the moment, the VPA can operate in four different modes:
74
70
75
71
{{< table caption="Different modes of the VPA" >}}
76
72
Mode | Description
@@ -81,14 +77,25 @@ Mode | Description
81
77
`Off` | The VPA does not automatically change the resource requirements of the pods. The recommendations are calculated and can be inspected in the VPA object.
For workloads that need to be scaled based on the size of the cluster (for example
87
93
`cluster-dns` or other system components), you can use the _Cluster Proportional Autoscaler_.<br />
88
94
Just like the VPA, it is not part of the Kubernetes core, but hosted in its
89
95
own repository [on GitHub](https://github.com/kubernetes-sigs/cluster-proportional-autoscaler).
90
96
91
-
The Cluster Proportional Autoscaler watches the number of schedulable {{< glossary_tooltip text="nodes" term_id="node" >}} and cores and scales the number of replicas of the target workload accordingly.
97
+
The Cluster Proportional Autoscaler watches the number of schedulable {{< glossary_tooltip text="nodes" term_id="node" >}}
98
+
and cores and scales the number of replicas of the target workload accordingly.
92
99
93
100
### Event driven Autoscaling
94
101
@@ -101,20 +108,30 @@ a wide range of adapters for different event sources to choose from.
101
108
102
109
### Autoscaling based on schedules
103
110
104
-
_tbd_
111
+
Another strategy for scaling your workloads is to **schedule** the scaling operations, for example in order to
112
+
reduce resource consumption during off-peak hours.
113
+
114
+
Similar to event driven autoscaling, such behavior can be achieved using KEDA in conjunction with
115
+
its [`Cron` scaler](https://keda.sh/docs/2.13/scalers/cron/). The `Cron` scaler allows you to define schedules
116
+
(and time zones) for scaling your workloads in or out.
105
117
106
118
## Scaling cluster infrastructure
107
119
108
-
_tbd_, short summary
120
+
If scaling workloads isn't enough to meet your needs, you can also scale your cluster infrastructure itself.
121
+
122
+
Scaling the cluster infrastructure normally means adding or removing {{< glossary_tooltip text="nodes" term_id="node" >}}.
123
+
This can be done using one of two available autoscalers:
0 commit comments