Skip to content

Commit 16fa27b

Browse files
authored
Merge pull request #54786 from lahinson/etcd-quorum-guard-osdocs-5351
[OCPBUGS-5351]: adding quorum guard instructions to etcd docs
2 parents 31ff7f9 + 7b95516 commit 16fa27b

File tree

3 files changed

+69
-0
lines changed

3 files changed

+69
-0
lines changed

modules/restore-replace-crashlooping-etcd-member.adoc

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -144,6 +144,15 @@ sh-4.2# etcdctl member list -w table
144144
+
145145
You can now exit the node shell.
146146

147+
. Turn off the quorum guard by entering the following command:
148+
+
149+
[source,terminal]
150+
----
151+
$ oc patch etcd/cluster --type=merge -p '{"spec": {"unsupportedConfigOverrides": {"useUnsupportedUnsafeNonHANonProductionUnstableEtcd": true}}}'
152+
----
153+
+
154+
This command ensures that you can successfully re-create secrets and roll out the static pods.
155+
147156
. Remove the old secrets for the unhealthy etcd member that was removed.
148157

149158
.. List the secrets for the unhealthy etcd member that was removed.
@@ -199,6 +208,20 @@ $ oc patch etcd cluster -p='{"spec": {"forceRedeploymentReason": "single-master-
199208
+
200209
When the etcd cluster Operator performs a redeployment, it ensures that all control plane nodes have a functioning etcd pod.
201210

211+
. Turn the quorum guard back on by entering the following command:
212+
+
213+
[source,terminal]
214+
----
215+
$ oc patch etcd/cluster --type=merge -p '\{"spec": {"unsupportedConfigOverrides": null}}
216+
----
217+
218+
. You can verify that the `unsupportedConfigOverrides` section is removed from the object by entering this command:
219+
+
220+
[source,terminal]
221+
----
222+
$ oc get etcd/cluster -oyaml
223+
----
224+
202225
.Verification
203226
204227
* Verify that the new member is available and healthy.

modules/restore-replace-stopped-baremetal-etcd-member.adoc

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -115,6 +115,15 @@ You can now exit the node shell.
115115
After you remove the member, the cluster might be unreachable for a short time while the remaining etcd instances reboot.
116116
====
117117

118+
. Turn off the quorum guard by entering the following command:
119+
+
120+
[source,terminal]
121+
----
122+
$ oc patch etcd/cluster --type=merge -p '{"spec": {"unsupportedConfigOverrides": {"useUnsupportedUnsafeNonHANonProductionUnstableEtcd": true}}}'
123+
----
124+
+
125+
This command ensures that you can successfully re-create secrets and roll out the static pods.
126+
118127
. Remove the old secrets for the unhealthy etcd member that was removed by running the following commands.
119128

120129
.. List the secrets for the unhealthy etcd member that was removed.
@@ -531,6 +540,20 @@ openshift-compute-0 Ready worker 3h58m v1.25.0
531540
openshift-compute-1 Ready worker 3h58m v1.25.0
532541
----
533542

543+
. Turn the quorum guard back on by entering the following command:
544+
+
545+
[source,terminal]
546+
----
547+
$ oc patch etcd/cluster --type=merge -p '\{"spec": {"unsupportedConfigOverrides": null}}
548+
----
549+
550+
. You can verify that the `unsupportedConfigOverrides` section is removed from the object by entering this command:
551+
+
552+
[source,terminal]
553+
----
554+
$ oc get etcd/cluster -oyaml
555+
----
556+
534557
.Verification
535558
536559
. Verify that all etcd pods are running properly.

modules/restore-replace-stopped-etcd-member.adoc

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -114,6 +114,15 @@ You can now exit the node shell.
114114
After you remove the member, the cluster might be unreachable for a short time while the remaining etcd instances reboot.
115115
====
116116

117+
. Turn off the quorum guard by entering the following command:
118+
+
119+
[source,terminal]
120+
----
121+
$ oc patch etcd/cluster --type=merge -p '{"spec": {"unsupportedConfigOverrides": {"useUnsupportedUnsafeNonHANonProductionUnstableEtcd": true}}}'
122+
----
123+
+
124+
This command ensures that you can successfully re-create secrets and roll out the static pods.
125+
117126
. Remove the old secrets for the unhealthy etcd member that was removed.
118127

119128
.. List the secrets for the unhealthy etcd member that was removed.
@@ -307,6 +316,20 @@ clustername-8qw5l-worker-us-east-1c-pkg26 Running m4.large us-east-1
307316
+
308317
It might take a few minutes for the new machine to be created. The etcd cluster Operator will automatically sync when the machine or node returns to a healthy state.
309318

319+
. Turn the quorum guard back on by entering the following command:
320+
+
321+
[source,terminal]
322+
----
323+
$ oc patch etcd/cluster --type=merge -p '\{"spec": {"unsupportedConfigOverrides": null}}
324+
----
325+
326+
. You can verify that the `unsupportedConfigOverrides` section is removed from the object by entering this command:
327+
+
328+
[source,terminal]
329+
----
330+
$ oc get etcd/cluster -oyaml
331+
----
332+
310333
.Verification
311334
312335
. Verify that all etcd pods are running properly.

0 commit comments

Comments
 (0)