Skip to content

Commit 0fa98c7

Browse files
authored
Merge pull request #42794 from abhatt-rh/telcodocs-451
ECOPORJECT-201: Documentation for the Poison Pill Operator for 4.10
2 parents a036a7f + 2da73a5 commit 0fa98c7

File tree

1 file changed

+32
-2
lines changed

1 file changed

+32
-2
lines changed

modules/eco-poison-pill-operator-about.adoc

Lines changed: 32 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -40,12 +40,42 @@ spec:
4040
----
4141

4242
<1> Specify the timeout duration for the surviving peer, after which the Operator can assume that an unhealthy node has been rebooted. The Operator automatically calculates the lower limit for this value. However, if different nodes have different watchdog timeouts, you must change this value to a higher value.
43-
<2> Specify the file path of the watchdog device in the nodes. If a watchdog device is unavailable, the `PoisonPillConfig` CR uses a software reboot.
43+
<2> Specify the file path of the watchdog device in the nodes. If you enter an incorrect path to the watchdog device, the Poison Pill Operator automatically detects the softdog device path.
44+
+
45+
If a watchdog device is unavailable, the `PoisonPillConfig` CR uses a software reboot.
4446
<3> Specify if you want to enable software reboot of the unhealthy nodes. By default, the value of `isSoftwareRebootEnabled` is set to `true`. To disable the software reboot, set the parameter value to `false`.
4547
<4> Specify the timeout duration to check connectivity with each API server. When this duration elapses, the Operator starts remediation.
4648
<5> Specify the frequency to check connectivity with each API server.
4749
<6> Specify a threshold value. After reaching this threshold, the node starts contacting its peers.
48-
<7> Specify the timeout duration to connect with the peer API server.
50+
<7> Specify the timeout duration for the peer to connect the API server.
4951
<8> Specify the timeout duration for establishing connection with the peer.
5052
<9> Specify the timeout duration to get a response from the peer.
5153
<10> Specify the frequency to update peer information, such as IP address.
54+
55+
[id="understanding-poison-pill-remediation-template-config_{context}"]
56+
== Understanding the Poison Pill Remediation Template configuration
57+
58+
The Poison Pill Operator also creates the `PoisonPillRemediationTemplate` CR with the name `poison-pill-default-template` in the Poison Pill Operator's namespace. This CR defines the remediation strategy for the nodes.
59+
60+
The default remediation strategy is `NodeDeletion` that removes the `node` object.
61+
In {product-title} 4.10, the Poison Pill Operator introduces a new remediation strategy called `ResourceDeletion`. The `ResourceDeletion` remediation strategy removes the pods and associated volume attachments on the node rather than the `node` object. This strategy helps to recover workloads faster.
62+
63+
The `PoisonPillRemediationTemplate` CR resembles the following YAML file:
64+
65+
[source,yaml]
66+
----
67+
apiVersion: poison-pill.medik8s.io/v1alpha1
68+
kind: PoisonPillRemediationTemplate
69+
metadata:
70+
creationTimestamp: "2022-03-02T08:02:40Z"
71+
generation: 1
72+
name: poison-pill-default-template
73+
namespace: openshift-operators
74+
resourceVersion: "596469"
75+
uid: 5d29e437-c485-48fa-ba9e-0354649afd31
76+
spec:
77+
template:
78+
spec:
79+
remediationStrategy: NodeDeletion <1>
80+
----
81+
<1> Specifies the remediation strategy. The default remediation strategy is `NodeDeletion`.

0 commit comments

Comments
 (0)