|
3 | 3 | // * machine_management/applying-autoscaling.adoc |
4 | 4 | // * post_installation_configuration/cluster-tasks.adoc |
5 | 5 |
|
| 6 | +:_content-type: REFERENCE |
6 | 7 | [id="cluster-autoscaler-cr_{context}"] |
7 | 8 | = ClusterAutoscaler resource definition |
8 | 9 |
|
@@ -38,26 +39,29 @@ spec: |
38 | 39 | delayAfterDelete: 5m <13> |
39 | 40 | delayAfterFailure: 30s <14> |
40 | 41 | unneededTime: 5m <15> |
| 42 | + utilizationThreshold: 0.4 <16> |
41 | 43 | ---- |
42 | 44 | <1> Specify the priority that a pod must exceed to cause the cluster autoscaler to deploy additional nodes. Enter a 32-bit integer value. The `podPriorityThreshold` value is compared to the value of the `PriorityClass` that you assign to each pod. |
43 | 45 | <2> Specify the maximum number of nodes to deploy. This value is the total number of machines that are deployed in your cluster, not just the ones that the autoscaler controls. Ensure that this value is large enough to account for all of your control plane and compute machines and the total number of replicas that you specify in your `MachineAutoscaler` resources. |
44 | 46 | <3> Specify the minimum number of cores to deploy in the cluster. |
45 | 47 | <4> Specify the maximum number of cores to deploy in the cluster. |
46 | 48 | <5> Specify the minimum amount of memory, in GiB, in the cluster. |
47 | 49 | <6> Specify the maximum amount of memory, in GiB, in the cluster. |
48 | | -<7> Optionally, specify the type of GPU node to deploy. Only `nvidia.com/gpu` and `amd.com/gpu` are valid types. |
| 50 | +<7> Optional: Specify the type of GPU node to deploy. Only `nvidia.com/gpu` and `amd.com/gpu` are valid types. |
49 | 51 | <8> Specify the minimum number of GPUs to deploy in the cluster. |
50 | 52 | <9> Specify the maximum number of GPUs to deploy in the cluster. |
51 | 53 | <10> In this section, you can specify the period to wait for each action by using any valid link:https://golang.org/pkg/time/#ParseDuration[ParseDuration] interval, including `ns`, `us`, `ms`, `s`, `m`, and `h`. |
52 | 54 | <11> Specify whether the cluster autoscaler can remove unnecessary nodes. |
53 | | -<12> Optionally, specify the period to wait before deleting a node after a node has recently been _added_. If you do not specify a value, the default value of `10m` is used. |
54 | | -<13> Specify the period to wait before deleting a node after a node has recently been _deleted_. If you do not specify a value, the default value of `10s` is used. |
55 | | -<14> Specify the period to wait before deleting a node after a scale down failure occurred. If you do not specify a value, the default value of `3m` is used. |
56 | | -<15> Specify the period before an unnecessary node is eligible for deletion. If you do not specify a value, the default value of `10m` is used. |
| 55 | +<12> Optional: Specify the period to wait before deleting a node after a node has recently been _added_. If you do not specify a value, the default value of `10m` is used. |
| 56 | +<13> Optional: Specify the period to wait before deleting a node after a node has recently been _deleted_. If you do not specify a value, the default value of `0s` is used. |
| 57 | +<14> Optional: Specify the period to wait before deleting a node after a scale down failure occurred. If you do not specify a value, the default value of `3m` is used. |
| 58 | +<15> Optional: Specify the period before an unnecessary node is eligible for deletion. If you do not specify a value, the default value of `10m` is used. |
| 59 | +<16> Optional: Specify the _node utilization level_ below which an unnecessary node is eligible for deletion. The node utilization level is the sum of the requested resources divided by the allocated resources for the node, and must be a value greater than `0` but less than `1`. If you do not specify a value, the cluster autoscaler uses a default value of `0.5`, which corresponds to 50% utilization. |
| 60 | +// Might be able to add a formula to show this visually, but need to look into asciidoc math formatting and what our tooling supports. |
57 | 61 |
|
58 | 62 | [NOTE] |
59 | 63 | ==== |
60 | 64 | When performing a scaling operation, the cluster autoscaler remains within the ranges set in the `ClusterAutoscaler` resource definition, such as the minimum and maximum number of cores to deploy or the amount of memory in the cluster. However, the cluster autoscaler does not correct the current values in your cluster to be within those ranges. |
61 | 65 |
|
62 | | -The minimum and maximum CPUs, memory, and GPU values are determined by calculating those resources on all nodes in the cluster, even if the cluster autoscaler does not manage the nodes. For example, the control plane nodes are considered in the total memory in the cluster, even though the cluster autoscaler does not manage the control plane nodes. |
| 66 | +The minimum and maximum CPUs, memory, and GPU values are determined by calculating those resources on all nodes in the cluster, even if the cluster autoscaler does not manage the nodes. For example, the control plane nodes are considered in the total memory in the cluster, even though the cluster autoscaler does not manage the control plane nodes. |
63 | 67 | ==== |
0 commit comments