You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: modules/rosa-planning-cluster-maximums.adoc
+8-1Lines changed: 8 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,14 @@
6
6
[id="tested-cluster-maximums_{context}"]
7
7
= ROSA tested cluster maximums
8
8
9
-
The following table specifies the maximum limits for each tested type in a {product-title} cluster.
9
+
Consider the following tested object maximums when you plan a {product-title} (ROSA) cluster installation. The table specifies the maximum limits for each tested type in a ROSA cluster.
10
+
11
+
These guidelines are based on a cluster of 102 compute (also known as worker) nodes in a multiple availability zone configuration. For smaller clusters, the maximums are lower.
12
+
13
+
[NOTE]
14
+
====
15
+
The OpenShift Container Platform version used in all of the tests is OCP 4.8.0.
= Control plane and infrastructure node sizing and scaling
8
8
9
-
Consider the following tested object maximums when you plan your {product-title} cluster.
9
+
When you install a {product-title} (ROSA) cluster, the sizing of the control plane and infrastructure nodes are automatically determined by the compute node count.
10
10
11
-
These guidelines are based on a cluster of 102 workers in a multi-availability zone configuration. For smaller clusters, the maximums are lower.
11
+
If you change the number of compute nodes in your cluster after installation, the Red Hat Site Reliability Engineering (SRE) team scales the control plane and infrastructure nodes as required to maintain cluster stability.
12
12
13
-
The sizing of the control plane and infrastructure nodes is dynamically calculated during the installation process, based on the number of worker nodes. If you change the number of worker nodes after the installation, control plane and infra nodes must be resized manually. Infra nodes are resized by the Red Hat SRE team, and you can link:https://access.redhat.com/[open a ticket in the Customer Portal] to request the infra node resizing.
13
+
[id="node-sizing-during-installation_{context}"]
14
+
== Node sizing during installation
14
15
15
-
The following table lists the size of control plane and infrastructure nodes that are assigned during installation.
16
+
During the installation process, the sizing of the control plane and infrastructure nodes are dynamically calculated. The sizing calculation is based on the number of compute nodes in a cluster.
17
+
18
+
The following table lists the control plane and infrastructure node sizing that is applied during installation.
16
19
17
20
[options="header",cols="3*"]
18
21
|===
19
-
| Number of worker nodes |Control plane size |Infrastructure node size
22
+
| Number of compute nodes |Control plane size |Infrastructure node size
20
23
21
24
|1 to 25
22
25
|m5.2xlarge
@@ -32,11 +35,47 @@ The following table lists the size of control plane and infrastructure nodes tha
32
35
|===
33
36
[.small]
34
37
--
35
-
1. The maximum number of worker nodes on ROSA is 180
38
+
1. The maximum number of compute nodes on ROSA is 180.
36
39
--
37
40
38
-
For larger clusters, infrastructure node sizing can become a large impacting factor to scalability. There are many factors that influence the stated thresholds, including the etcd version or storage data format.
41
+
[id="node-scaling-after-installation_{context}"]
42
+
== Node scaling after installation
39
43
40
-
Exceeding these limits does not necessarily mean that the cluster will fail. In most cases, exceeding these numbers results in lower overall performance.
44
+
If you change the number of compute nodes after installation, the control plane and infrastructure nodes are scaled by the Red Hat Site Reliability Engineering (SRE) team as required. The nodes are scaled to maintain platform stability.
45
+
46
+
Post-installation scaling requirements for control plane and infrastructure nodes are assessed on a case-by-case basis. Node resource consumption and received alerts are taken into consideration.
47
+
48
+
.Rules for control plane node resizing alerts
49
+
50
+
Resizing alerts are triggered for the control plane nodes in a cluster when either of the following scenarios are true:
51
+
52
+
* Each control plane node has less than 16GiB RAM, and there are more than 25 and less than 101 compute nodes.
53
+
* Each control plane node has less than 32GiB RAM, and there are more than 100 compute nodes.
54
+
+
55
+
[NOTE]
56
+
====
57
+
The maximum number of compute nodes on ROSA is 180.
58
+
====
59
+
60
+
.Rules for infrastructure node resizing alerts
41
61
42
-
The OpenShift Container Platform version used in all of the tests is OCP 4.8.0.
62
+
Resizing alerts are triggered for the infrastructure nodes in a cluster when either of the following scenarios are true:
63
+
64
+
* Each infrastructure node has less than 16GiB RAM or less than 5 CPUs, and there are more than 25 and less than 101 compute nodes.
65
+
* Each infrastructure node has less than 32GiB RAM or less than 9 CPUs, and there are more than 100 compute nodes.
66
+
+
67
+
[NOTE]
68
+
====
69
+
The maximum number of compute nodes on ROSA is 180.
70
+
====
71
+
72
+
The SRE team might scale the control plane and infrastructure nodes for additional reasons, for example to manage an increase in resource consumption on the nodes.
73
+
74
+
When scaling is applied, the customer is notified through a service log entry.
For larger clusters, infrastructure node sizing can become a significant impacting factor to scalability. There are many factors that influence the stated thresholds, including the etcd version or storage data format.
80
+
81
+
Exceeding these limits does not necessarily mean that the cluster will fail. In most cases, exceeding these numbers results in lower overall performance.
xref:../rosa_getting_started_iam/rosa-required-aws-service-quotas.adoc#rosa-required-aws-service-quotas[Review the required AWS service quotas]
25
+
* xref:../rosa_getting_started_iam/rosa-required-aws-service-quotas.adoc#rosa-required-aws-service-quotas[Review the required AWS service quotas]
26
26
27
27
[role="_additional-resources"]
28
28
== Additional resources
29
-
* See xref:../../rosa_planning/rosa-limits-scalability.adoc#initial-planning-considerations_rosa-limits-scalability[Intial Planning Considerations] for guidance on worker node count.
30
-
* See xref:../../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters] for information about how Red Hat site reliability engineering accesses ROSA clusters.
29
+
* xref:../../rosa_planning/rosa-limits-scalability.adoc#rosa-limits-scalability[Limits and scalability]
30
+
* xref:../../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters]
31
31
* xref:../rosa_getting_started_iam/rosa-getting-started-workflow.adoc#rosa-understanding-the-deployment-workflow[Understanding the ROSA deployment workflow]
This document details the tested cluster maximums for {product-title} (ROSA) clusters, along with information about the test environment and configuration used to test the maximums. Information about control plane and infrastructure node sizing and scaling is also provided.
xref:../rosa_planning/rosa-sts-required-aws-service-quotas.adoc#rosa-sts-required-aws-service-quotas[Review the required AWS service quotas]
31
+
* xref:../rosa_planning/rosa-sts-required-aws-service-quotas.adoc#rosa-sts-required-aws-service-quotas[Review the required AWS service quotas]
32
32
33
33
[role="_additional-resources"]
34
34
== Additional resources
35
-
* See xref:../rosa_planning/rosa-limits-scalability.adoc#initial-planning-considerations_rosa-limits-scalability[Intial Planning Considerations] for guidance on worker node count.
36
-
* See xref:../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters] for information about how Red Hat site reliability engineering accesses ROSA clusters.
35
+
* xref:../rosa_planning/rosa-limits-scalability.adoc#rosa-limits-scalability[Limits and scalability]
36
+
* xref:../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters]
0 commit comments