You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: modules/eco-poison-pill-operator-about.adoc
+32-2Lines changed: 32 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -40,12 +40,42 @@ spec:
40
40
----
41
41
42
42
<1> Specify the timeout duration for the surviving peer, after which the Operator can assume that an unhealthy node has been rebooted. The Operator automatically calculates the lower limit for this value. However, if different nodes have different watchdog timeouts, you must change this value to a higher value.
43
-
<2> Specify the file path of the watchdog device in the nodes. If a watchdog device is unavailable, the `PoisonPillConfig` CR uses a software reboot.
43
+
<2> Specify the file path of the watchdog device in the nodes. If you enter an incorrect path to the watchdog device, the Poison Pill Operator automatically detects the softdog device path.
44
+
+
45
+
If a watchdog device is unavailable, the `PoisonPillConfig` CR uses a software reboot.
44
46
<3> Specify if you want to enable software reboot of the unhealthy nodes. By default, the value of `isSoftwareRebootEnabled` is set to `true`. To disable the software reboot, set the parameter value to `false`.
45
47
<4> Specify the timeout duration to check connectivity with each API server. When this duration elapses, the Operator starts remediation.
46
48
<5> Specify the frequency to check connectivity with each API server.
47
49
<6> Specify a threshold value. After reaching this threshold, the node starts contacting its peers.
48
-
<7> Specify the timeout duration to connect with the peer API server.
50
+
<7> Specify the timeout duration for the peer to connect the API server.
49
51
<8> Specify the timeout duration for establishing connection with the peer.
50
52
<9> Specify the timeout duration to get a response from the peer.
51
53
<10> Specify the frequency to update peer information, such as IP address.
== Understanding the Poison Pill Remediation Template configuration
57
+
58
+
The Poison Pill Operator also creates the `PoisonPillRemediationTemplate` CR with the name `poison-pill-default-template` in the Poison Pill Operator's namespace. This CR defines the remediation strategy for the nodes.
59
+
60
+
The default remediation strategy is `NodeDeletion` that removes the `node` object.
61
+
In {product-title} 4.10, the Poison Pill Operator introduces a new remediation strategy called `ResourceDeletion`. The `ResourceDeletion` remediation strategy removes the pods and associated volume attachments on the node rather than the `node` object. This strategy helps to recover workloads faster.
62
+
63
+
The `PoisonPillRemediationTemplate` CR resembles the following YAML file:
64
+
65
+
[source,yaml]
66
+
----
67
+
apiVersion: poison-pill.medik8s.io/v1alpha1
68
+
kind: PoisonPillRemediationTemplate
69
+
metadata:
70
+
creationTimestamp: "2022-03-02T08:02:40Z"
71
+
generation: 1
72
+
name: poison-pill-default-template
73
+
namespace: openshift-operators
74
+
resourceVersion: "596469"
75
+
uid: 5d29e437-c485-48fa-ba9e-0354649afd31
76
+
spec:
77
+
template:
78
+
spec:
79
+
remediationStrategy: NodeDeletion <1>
80
+
----
81
+
<1> Specifies the remediation strategy. The default remediation strategy is `NodeDeletion`.
0 commit comments