Skip to content

Commit 0a359cf

Browse files
authored
Merge pull request #60789 from dfitzmau/OSDOCS-5020
2 parents 2e0868e + e4a11a2 commit 0a359cf

File tree

1 file changed

+24
-4
lines changed

1 file changed

+24
-4
lines changed

modules/dr-restoring-cluster-state.adoc

Lines changed: 24 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ You can use a saved etcd backup to restore a previous cluster state or restore a
1212

1313
[NOTE]
1414
====
15-
If your cluster uses a control plane machine set, see "Troubleshooting the control plane machine set" for a more simple etcd recovery procedure.
15+
If your cluster uses a control plane machine set, see "Troubleshooting the control plane machine set" for a more simple etcd recovery procedure.
1616
====
1717

1818
[IMPORTANT]
@@ -290,7 +290,8 @@ etcd-ip-10-0-143-125.ec2.internal 1/1 Running 1
290290
----
291291
+
292292
If the status is `Pending`, or the output lists more than one running etcd pod, wait a few minutes and check again.
293-
293+
+
294+
.. Optional: For each lost control plane host, repeat the steps for verifying that the single member control plane has started successfully.
294295
+
295296
[NOTE]
296297
====
@@ -304,6 +305,8 @@ Perform the following step only if you are using `OVNKubernetes` network plugin.
304305
$ oc delete node <non-recovery-controlplane-host-1> <non-recovery-controlplane-host-2>
305306
----
306307

308+
. Optional: For any remaining non-recovery control plane nodes, delete and recreate each non-recovery control plane node.
309+
307310
. Verify that the Cluster Network Operator (CNO) redeploys the OVN-Kubernetes control plane and that it no longer references the wrong controller IP addresses. To verify this result, regularly check the output of the following command. Wait until it returns an empty result before you proceed with the next step.
308311
+
309312
[source,terminal]
@@ -379,7 +382,7 @@ $ oc get pods -n openshift-ovn-kubernetes | grep ovnkube-node
379382

380383
. Delete and re-create other non-recovery, control plane machines, one by one. After the machines are re-created, a new revision is forced and etcd automatically scales up.
381384
+
382-
** If you use a user-provisioned bare metal installation, you can re-create a control plane machine by using the same method that you used to originally create it. For more information, see "Installing a user-provisioned cluster on bare metal".
385+
** If you use a user-provisioned bare metal installation, you can re-create a control plane machine by using the same method that you used to originally create it. For more information, see "Installing a user-provisioned cluster on bare metal".
383386
+
384387
[WARNING]
385388
====
@@ -712,4 +715,21 @@ etcd-ip-10-0-173-171.ec2.internal 2/2 Running 0
712715

713716
To ensure that all workloads return to normal operation following a recovery procedure, restart each pod that stores Kubernetes API information. This includes {product-title} components such as routers, Operators, and third-party components.
714717

715-
Note that it might take several minutes after completing this procedure for all services to be restored. For example, authentication by using `oc login` might not immediately work until the OAuth server pods are restarted.
718+
[NOTE]
719+
====
720+
On completion of the previous procedural steps, you might need to wait a few minutes for all services to return to their restored state. For example, authentication by using `oc login` might not immediately work until the OAuth server pods are restarted.
721+
722+
Consider using the `system:admin` `kubeconfig` file for immediate authentication. This method basis its authentication on SSL/TLS client certificates as against OAuth tokens. You can authenticate with this file by issuing the following command:
723+
724+
[source,terminal]
725+
----
726+
$ export KUBECONFIG=<installation_directory>/auth/kubeconfig
727+
----
728+
729+
Issue the following command to display your authenticated user name:
730+
731+
[source,terminal]
732+
----
733+
$ oc whoami
734+
----
735+
====

0 commit comments

Comments
 (0)