You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1. Recreate the node problem detector with the new configuration file:
87
+
1. Recreate the Node Problem Detector with the new configuration file:
88
88
89
89
```shell
90
90
# If you have a node-problem-detector running, delete before recreating
@@ -93,25 +93,25 @@ to overwrite the configuration:
93
93
```
94
94
95
95
{{< note >}}
96
-
This approach only applies to a node problem detector started with `kubectl`.
96
+
This approach only applies to a Node Problem Detector started with `kubectl`.
97
97
{{< /note >}}
98
98
99
-
Overwriting a configuration is not supported if a node problem detector runs as a cluster Addon.
99
+
Overwriting a configuration is not supported if a Node Problem Detector runs as a cluster Addon.
100
100
The Addon manager does not support `ConfigMap`.
101
101
102
102
## Kernel Monitor
103
103
104
-
*Kernel Monitor* is a system log monitor daemon supported in the node problem detector.
104
+
*Kernel Monitor* is a system log monitor daemon supported in the Node Problem Detector.
105
105
Kernel monitor watches the kernel log and detects known kernel issues following predefined rules.
106
106
107
107
The Kernel Monitor matches kernel issues according to a set of predefined rule list in
108
-
[`config/kernel-monitor.json`](https://github.com/kubernetes/node-problem-detector/blob/v0.1/config/kernel-monitor.json). The rule list is extensible. You can extend the rule list by overwriting the
108
+
[`config/kernel-monitor.json`](https://github.com/kubernetes/node-problem-detector/blob/v0.1/config/kernel-monitor.json). The rule list is extensible. You can expand the rule list by overwriting the
109
109
configuration.
110
110
111
111
### Add new NodeConditions
112
112
113
-
To support a new `NodeCondition`, you can extend the `conditions` field in
114
-
`config/kernel-monitor.json` with a new condition definition such as:
113
+
To support a new `NodeCondition`, create a condition definition within the `conditions` field in
114
+
`config/kernel-monitor.json`, for example:
115
115
116
116
```json
117
117
{
@@ -140,7 +140,7 @@ with a new rule definition:
140
140
Check your kernel log path location in your operating system (OS) distribution.
141
141
The Linux kernel [log device](https://www.kernel.org/doc/Documentation/ABI/testing/dev-kmsg) is usually presented as `/dev/kmsg`. However, the log path location varies by OS distribution.
142
142
The `log` field in `config/kernel-monitor.json` represents the log path inside the container.
143
-
You can configure the `log` field to match the device path as seen by the node problem detector.
143
+
You can configure the `log` field to match the device path as seen by the Node Problem Detector.
144
144
145
145
### Add support for another log format {#support-other-log-format}
146
146
@@ -152,11 +152,11 @@ You can implement a new translator for a new log format.
152
152
153
153
## Recommendations and restrictions
154
154
155
-
It is recommended to run the node problem detector in your cluster to monitor node health.
156
-
When running the node problem detector, you can expect extra resource overhead on each node.
155
+
It is recommended to run the Node Problem Detector in your cluster to monitor node health.
156
+
When running the Node Problem Detector, you can expect extra resource overhead on each node.
157
157
Usually this is fine, because:
158
158
159
159
* The kernel log grows relatively slowly.
160
-
* A resource limit is set for the node problem detector.
161
-
* Even under high load, the resource usage is acceptable. For more information, see the node problem detector
160
+
* A resource limit is set for the Node Problem Detector.
161
+
* Even under high load, the resource usage is acceptable. For more information, see the Node Problem Detector
0 commit comments