You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
You can place nodes into maintenance mode using the `oc adm` utility, or using `NodeMaintenance` custom resources (CRs).
9
-
10
8
The Node Maintenance Operator watches for new or deleted `NodeMaintenance` CRs. When a new `NodeMaintenance` CR is detected, no new workloads are scheduled and the node is cordoned off from the rest of the cluster. All pods that can be evicted are evicted from the node. When a `NodeMaintenance` CR is deleted, the node that is referenced in the CR is made available for new workloads.
Copy file name to clipboardExpand all lines: modules/eco-node-maintenance-operator-installation-web-console.adoc
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,4 +29,5 @@ To confirm that the installation is successful:
29
29
If the Operator is not installed successfully:
30
30
31
31
. Navigate to the *Operators*->*Installed Operators* page and inspect the `Status` column for any errors or failures.
32
-
. Navigate to the *Workloads*->*Pods* page and check the logs in any pods in the `openshift-operators` project that are reporting issues.
32
+
. Navigate to the *Operators*->*Installed Operators*->*Node Maintenance Operator*->*Details* page, and inspect the `Conditions` section for errors before pod creation.
33
+
. Navigate to the *Workloads*->*Pods* page, search for the `Node Maintenance Operator` pod in the installed namespace, and check the logs in the `Logs` tab.
= Resuming a bare-metal node from maintenance mode
8
+
Resume a bare-metal node from maintenance mode using the Options menu {kebab} found on each node in the *Compute*->*Nodes* list, or using the *Actions* control of the *Node Details* screen.
9
+
10
+
.Procedure
11
+
12
+
. From the *Administrator* perspective of the web console, click *Compute*->*Nodes*.
13
+
. You can resume the node from this screen, which makes it easier to perform actions on multiple nodes, or from the *Node Details* screen, where you can view comprehensive details of the selected node:
14
+
** Click the Options menu {kebab} at the end of the node and select
15
+
*Stop Maintenance*.
16
+
** Click the node name to open the *Node Details* screen and click
17
+
*Actions*->*Stop Maintenance*.
18
+
. Click *Stop Maintenance* in the confirmation window.
19
+
20
+
The node becomes schedulable. If it had virtual machine instances that were running on the node prior to maintenance, then they will not automatically migrate back to this node.
21
+
22
+
.Verification
23
+
24
+
* Navigate to the *Compute*->*Nodes* page and verify that the corresponding node has a status of `Ready`.
Set a bare-metal node to maintenance mode using the Options menu {kebab} found on each node in the *Compute*->*Nodes* list, or using the *Actions* control of the *Node Details* screen.
9
+
10
+
.Procedure
11
+
12
+
. From the *Administrator* perspective of the web console, click *Compute*->*Nodes*.
13
+
. You can set the node to maintenance from this screen, which makes it easier to perform actions on multiple nodes, or from the *Node Details* screen, where you can view comprehensive details of the selected node:
14
+
** Click the Options menu {kebab} at the end of the node and select *Start Maintenance*.
15
+
** Click the node name to open the *Node Details* screen and click
16
+
*Actions*->*Start Maintenance*.
17
+
. Click *Start Maintenance* in the confirmation window.
18
+
19
+
The node is no longer schedulable. If it had virtual machines with the `LiveMigration` eviction strategy, then it will live migrate them. All other pods and virtual machines on the node are deleted and recreated on another node.
20
+
21
+
.Verification
22
+
23
+
* Navigate to the *Compute*->*Nodes* page and verify that the corresponding node has a status of `Under maintenance`.
Copy file name to clipboardExpand all lines: modules/eco-setting-node-maintenance-cr-cli.adoc
+7-3Lines changed: 7 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -38,14 +38,18 @@ spec:
38
38
$ oc apply -f nodemaintenance-cr.yaml
39
39
----
40
40
41
-
. Check the progress of the maintenance task by running the following command, replacing `<node-name>` with the name of your node; for example, `node-1.example.com`:
41
+
.Verification
42
+
43
+
. Check the progress of the maintenance task by running the following command:
42
44
+
43
45
[source,terminal]
44
46
----
45
-
$ oc describe node node-1.example.com
47
+
$ oc describe node <node-name>
46
48
----
47
49
+
48
-
.Example output
50
+
where `<node-name>` is the name of your node; for example, `node-1.example.com`
You can use the Node Maintenance Operator to place nodes in maintenance mode. This is a standalone version of the Node Maintenance Operator that is independent of {VirtProductName} installation.
10
-
11
-
[NOTE]
12
-
====
13
-
If you have installed {VirtProductName}, you must use the Node Maintenance Operator that is bundled with it.
14
-
====
9
+
You can use the Node Maintenance Operator to place nodes in maintenance mode by using the `oc adm` utility or `NodeMaintenance` custom resources (CRs).
You can resume a node from maintenance mode from the CLI or by using a `NodeMaintenance` CR. Resuming a node brings it out of maintenance mode and makes it schedulable again.
40
+
You can resume a node from maintenance mode from the web console or from the CLI by using a `NodeMaintenance` CR. Resuming a node brings it out of maintenance mode and makes it schedulable again.
For clusters with bare-metal nodes, you can place a node into maintenance mode, and resume a node from maintenance mode, by using the web console *Actions* control.
49
+
50
+
[NOTE]
51
+
====
52
+
Clusters with bare-metal nodes can also place a node into maintenance mode, and resume a node from maintenance mode, by using the web console and CLI, as outlined. These methods, by using the web console *Actions* control, are applicable to bare-metal clusters only.
== Gathering data about the Node Maintenance Operator
51
63
To collect debugging information about the Node Maintenance Operator, use the `must-gather` tool. For information about the `must-gather` image for the Node Maintenance Operator, see xref:../../support/gathering-cluster-data.adoc#gathering-data-specific-features_gathering-cluster-data[Gathering data about specific features].
@@ -55,4 +67,4 @@ To collect debugging information about the Node Maintenance Operator, use the `m
55
67
== Additional resources
56
68
* xref:../../support/gathering-cluster-data.adoc#gathering-cluster-data[Gathering data about your cluster]
57
69
* xref:../../nodes/nodes/nodes-nodes-working.adoc#nodes-nodes-working-evacuating_nodes-nodes-working[Understanding how to evacuate pods on nodes]
58
-
* xref:../../nodes/nodes/nodes-nodes-working.adoc#nodes-nodes-working-marking_nodes-nodes-working[Understanding how to mark nodes as unschedulable or schedulable]
70
+
* xref:../../nodes/nodes/nodes-nodes-working.adoc#nodes-nodes-working-marking_nodes-nodes-working[Understanding how to mark nodes as unschedulable or schedulable]
0 commit comments