Skip to content

Commit e6d8217

Browse files
authored
Merge pull request #46359 from pneedle-rh/osdocs-3303-updating-rosa-planning-considerations
OSDOCS-3303 - Updating the ROSA planning considerations
2 parents 5cc11ab + 9ec79e2 commit e6d8217

File tree

6 files changed

+74
-21
lines changed

6 files changed

+74
-21
lines changed

modules/rosa-planning-cluster-maximums-environment.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ The following table lists the OpenShift Container Platform environment and confi
3939
|3
4040
|us-west-2
4141

42-
|Worker nodes
42+
|Compute nodes
4343
|m5.2xlarge
4444
|8
4545
|32

modules/rosa-planning-cluster-maximums.adoc

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,14 @@
66
[id="tested-cluster-maximums_{context}"]
77
= ROSA tested cluster maximums
88

9-
The following table specifies the maximum limits for each tested type in a {product-title} cluster.
9+
Consider the following tested object maximums when you plan a {product-title} (ROSA) cluster installation. The table specifies the maximum limits for each tested type in a ROSA cluster.
10+
11+
These guidelines are based on a cluster of 102 compute (also known as worker) nodes in a multiple availability zone configuration. For smaller clusters, the maximums are lower.
12+
13+
[NOTE]
14+
====
15+
The OpenShift Container Platform version used in all of the tests is OCP 4.8.0.
16+
====
1017

1118
.Tested cluster maximums
1219
[options="header",cols="50,50"]
Lines changed: 51 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,22 +1,25 @@
11

22
// Module included in the following assemblies:
33
//
4-
// rosa_planning/rosa-planning-environment.adoc
4+
// rosa_planning/rosa-limits-scalability.adoc
55

6-
[id="initial-planning-considerations_{context}"]
7-
= Initial planning considerations
6+
[id="control-plane-and-infra-node-sizing-and-scaling_{context}"]
7+
= Control plane and infrastructure node sizing and scaling
88

9-
Consider the following tested object maximums when you plan your {product-title} cluster.
9+
When you install a {product-title} (ROSA) cluster, the sizing of the control plane and infrastructure nodes are automatically determined by the compute node count.
1010

11-
These guidelines are based on a cluster of 102 workers in a multi-availability zone configuration. For smaller clusters, the maximums are lower.
11+
If you change the number of compute nodes in your cluster after installation, the Red Hat Site Reliability Engineering (SRE) team scales the control plane and infrastructure nodes as required to maintain cluster stability.
1212

13-
The sizing of the control plane and infrastructure nodes is dynamically calculated during the installation process, based on the number of worker nodes. If you change the number of worker nodes after the installation, control plane and infra nodes must be resized manually. Infra nodes are resized by the Red Hat SRE team, and you can link:https://access.redhat.com/[open a ticket in the Customer Portal] to request the infra node resizing.
13+
[id="node-sizing-during-installation_{context}"]
14+
== Node sizing during installation
1415

15-
The following table lists the size of control plane and infrastructure nodes that are assigned during installation.
16+
During the installation process, the sizing of the control plane and infrastructure nodes are dynamically calculated. The sizing calculation is based on the number of compute nodes in a cluster.
17+
18+
The following table lists the control plane and infrastructure node sizing that is applied during installation.
1619

1720
[options="header",cols="3*"]
1821
|===
19-
| Number of worker nodes |Control plane size |Infrastructure node size
22+
| Number of compute nodes |Control plane size |Infrastructure node size
2023

2124
|1 to 25
2225
|m5.2xlarge
@@ -32,11 +35,47 @@ The following table lists the size of control plane and infrastructure nodes tha
3235
|===
3336
[.small]
3437
--
35-
1. The maximum number of worker nodes on ROSA is 180
38+
1. The maximum number of compute nodes on ROSA is 180.
3639
--
3740

38-
For larger clusters, infrastructure node sizing can become a large impacting factor to scalability. There are many factors that influence the stated thresholds, including the etcd version or storage data format.
41+
[id="node-scaling-after-installation_{context}"]
42+
== Node scaling after installation
3943

40-
Exceeding these limits does not necessarily mean that the cluster will fail. In most cases, exceeding these numbers results in lower overall performance.
44+
If you change the number of compute nodes after installation, the control plane and infrastructure nodes are scaled by the Red Hat Site Reliability Engineering (SRE) team as required. The nodes are scaled to maintain platform stability.
45+
46+
Post-installation scaling requirements for control plane and infrastructure nodes are assessed on a case-by-case basis. Node resource consumption and received alerts are taken into consideration.
47+
48+
.Rules for control plane node resizing alerts
49+
50+
Resizing alerts are triggered for the control plane nodes in a cluster when either of the following scenarios are true:
51+
52+
* Each control plane node has less than 16GiB RAM, and there are more than 25 and less than 101 compute nodes.
53+
* Each control plane node has less than 32GiB RAM, and there are more than 100 compute nodes.
54+
+
55+
[NOTE]
56+
====
57+
The maximum number of compute nodes on ROSA is 180.
58+
====
59+
60+
.Rules for infrastructure node resizing alerts
4161

42-
The OpenShift Container Platform version used in all of the tests is OCP 4.8.0.
62+
Resizing alerts are triggered for the infrastructure nodes in a cluster when either of the following scenarios are true:
63+
64+
* Each infrastructure node has less than 16GiB RAM or less than 5 CPUs, and there are more than 25 and less than 101 compute nodes.
65+
* Each infrastructure node has less than 32GiB RAM or less than 9 CPUs, and there are more than 100 compute nodes.
66+
+
67+
[NOTE]
68+
====
69+
The maximum number of compute nodes on ROSA is 180.
70+
====
71+
72+
The SRE team might scale the control plane and infrastructure nodes for additional reasons, for example to manage an increase in resource consumption on the nodes.
73+
74+
When scaling is applied, the customer is notified through a service log entry.
75+
76+
[id="sizing-considerations-for-larger-clusters_{context}"]
77+
== Sizing considerations for larger clusters
78+
79+
For larger clusters, infrastructure node sizing can become a significant impacting factor to scalability. There are many factors that influence the stated thresholds, including the etcd version or storage data format.
80+
81+
Exceeding these limits does not necessarily mean that the cluster will fail. In most cases, exceeding these numbers results in lower overall performance.

rosa_install_access_delete_clusters/rosa_getting_started_iam/rosa-aws-prereqs.adoc

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -22,10 +22,10 @@ include::modules/rosa-aws-provisioned.adoc[leveloffset=+1]
2222
include::modules/osd-aws-privatelink-firewall-prerequisites.adoc[leveloffset=+1]
2323

2424
== Next steps
25-
xref:../rosa_getting_started_iam/rosa-required-aws-service-quotas.adoc#rosa-required-aws-service-quotas[Review the required AWS service quotas]
25+
* xref:../rosa_getting_started_iam/rosa-required-aws-service-quotas.adoc#rosa-required-aws-service-quotas[Review the required AWS service quotas]
2626

2727
[role="_additional-resources"]
2828
== Additional resources
29-
* See xref:../../rosa_planning/rosa-limits-scalability.adoc#initial-planning-considerations_rosa-limits-scalability[Intial Planning Considerations] for guidance on worker node count.
30-
* See xref:../../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters] for information about how Red Hat site reliability engineering accesses ROSA clusters.
29+
* xref:../../rosa_planning/rosa-limits-scalability.adoc#rosa-limits-scalability[Limits and scalability]
30+
* xref:../../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters]
3131
* xref:../rosa_getting_started_iam/rosa-getting-started-workflow.adoc#rosa-understanding-the-deployment-workflow[Understanding the ROSA deployment workflow]

rosa_planning/rosa-limits-scalability.adoc

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,13 @@ include::_attributes/attributes-openshift-dedicated.adoc[]
77

88
toc::[]
99

10-
include::modules/rosa-planning-considerations.adoc[leveloffset=+1]
10+
This document details the tested cluster maximums for {product-title} (ROSA) clusters, along with information about the test environment and configuration used to test the maximums. Information about control plane and infrastructure node sizing and scaling is also provided.
11+
1112
include::modules/rosa-planning-cluster-maximums.adoc[leveloffset=+1]
1213
include::modules/rosa-planning-cluster-maximums-environment.adoc[leveloffset=+1]
14+
include::modules/rosa-planning-considerations.adoc[leveloffset=+1]
15+
16+
[id="next-steps_configuring-alert-notifications"]
17+
== Next steps
18+
19+
* xref:../rosa_planning/rosa-planning-environment.adoc#rosa-planning-environment[Planning your environment]

rosa_planning/rosa-sts-aws-prereqs.adoc

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -28,9 +28,9 @@ include::modules/rosa-aws-provisioned.adoc[leveloffset=+1]
2828
include::modules/osd-aws-privatelink-firewall-prerequisites.adoc[leveloffset=+1]
2929

3030
== Next steps
31-
xref:../rosa_planning/rosa-sts-required-aws-service-quotas.adoc#rosa-sts-required-aws-service-quotas[Review the required AWS service quotas]
31+
* xref:../rosa_planning/rosa-sts-required-aws-service-quotas.adoc#rosa-sts-required-aws-service-quotas[Review the required AWS service quotas]
3232

3333
[role="_additional-resources"]
3434
== Additional resources
35-
* See xref:../rosa_planning/rosa-limits-scalability.adoc#initial-planning-considerations_rosa-limits-scalability[Intial Planning Considerations] for guidance on worker node count.
36-
* See xref:../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters] for information about how Red Hat site reliability engineering accesses ROSA clusters.
35+
* xref:../rosa_planning/rosa-limits-scalability.adoc#rosa-limits-scalability[Limits and scalability]
36+
* xref:../rosa_architecture/rosa_policy_service_definition/rosa-policy-process-security.adoc#rosa-policy-sre-access_rosa-policy-process-security[SRE access to all Red Hat OpenShift Service on AWS clusters]

0 commit comments

Comments
 (0)