You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| [KernelMonitor](https://github.com/kubernetes/node-problem-detector/blob/master/config/kernel-monitor.json) | KernelDeadlock | A system log monitor monitors kernel log and reports problems and metrics according to predefined rules. | disable_system_log_monitor
65
-
| [AbrtAdaptor](https://github.com/kubernetes/node-problem-detector/blob/master/config/abrt-adaptor.json) | None | Monitor ABRT log messages and report them further. ABRT (Automatic Bug Report Tool) is health monitoring daemon able to catch kernel problems as well as application crashes of various kinds occurred on the host. For more information visit the [link](https://github.com/abrt). | disable_system_log_monitor
66
-
| [CustomPluginMonitor](https://github.com/kubernetes/node-problem-detector/blob/master/config/custom-plugin-monitor.json) | On-demand(According to users configuration) | A custom plugin monitor for node-problem-detector to invoke and check various node problems with userdefined check scripts. See proposal [here](https://docs.google.com/document/d/1jK_5YloSYtboj-DtfjmYKxfNnUxCAvohLnsH5aGCAYQ/edit#). | disable_custom_plugin_monitor
67
-
| [SystemStatsMonitor](https://github.com/kubernetes/node-problem-detector/blob/master/config/system-stats-monitor.json) | None(Could be added in the future) | A system stats monitor for node-problem-detector to collect various health-related system stats as metrics. See proposal [here](https://docs.google.com/document/d/1SeaUz6kBavI283Dq8GBpoEUDrHA2a795xtw0OvjM568/edit). | disable_system_stats_monitor
65
+
| [AbrtAdaptor](https://github.com/kubernetes/node-problem-detector/blob/master/config/abrt-adaptor.json) | None | Monitor ABRT log messages and report them further. ABRT (Automatic Bug Report Tool) is a health monitoring daemon able to catch kernel problems as well as application crashes of various kinds that occur on the host. For more information visit the [link](https://github.com/abrt). | disable_system_log_monitor
66
+
| [CustomPluginMonitor](https://github.com/kubernetes/node-problem-detector/blob/master/config/custom-plugin-monitor.json) | On-demand(According to users configuration) | A custom plugin monitor for node-problem-detector to invoke and check various node problems with user-defined check scripts. See the proposal [here](https://docs.google.com/document/d/1jK_5YloSYtboj-DtfjmYKxfNnUxCAvohLnsH5aGCAYQ/edit#). | disable_custom_plugin_monitor
67
+
| [SystemStatsMonitor](https://github.com/kubernetes/node-problem-detector/blob/master/config/system-stats-monitor.json) | None(Could be added in the future) | A system stats monitor for node-problem-detector to collect various health-related system stats as metrics. See the proposal [here](https://docs.google.com/document/d/1SeaUz6kBavI283Dq8GBpoEUDrHA2a795xtw0OvjM568/edit). | disable_system_stats_monitor
68
68
69
69
# Exporter
70
70
71
71
An exporter is a component of node-problem-detector. It reports node problems and/or metrics to
72
-
certain back end. Some of them can be disable at compiletime using a build tag. List of supported exporters:
72
+
certain backends. Some of them can be disabled at compile-time using a build tag. List of supported exporters:
73
73
74
74
| Exporter |Description | Disabling Build Tag |
75
75
|----------|:-----------|:--------------------|
@@ -86,21 +86,21 @@ certain back end. Some of them can be disable at compile time using a build tag.
86
86
87
87
#### For System Log Monitor
88
88
89
-
*`--config.system-log-monitor`: List of paths to system log monitor configuration files, commaseparated, e.g.
89
+
*`--config.system-log-monitor`: List of paths to system log monitor configuration files, comma-separated, e.g.
Refer [heapster docs](https://github.com/kubernetes/heapster/blob/master/docs/source-configuration.md#kubernetes) for a complete list of available options.
119
+
Refer to [heapster docs](https://github.com/kubernetes/heapster/blob/master/docs/source-configuration.md#kubernetes) for a complete list of available options.
120
120
*`--address`: The address to bind the node problem detector server.
121
121
*`--port`: The port to bind the node problem detector server. Use 0 to disable.
122
122
@@ -127,13 +127,13 @@ For example, to run without auth, use the following config:
127
127
128
128
#### For Stackdriver exporter
129
129
130
-
*`--exporter.stackdriver`: Path to a Stackdriver exporter config file, e.g. [config/exporter/stackdriver-exporter.json](https://github.com/kubernetes/node-problem-detector/blob/master/config/exporter/stackdriver-exporter.json), default to empty string. Set to empty string to disable.
130
+
*`--exporter.stackdriver`: Path to a Stackdriver exporter config file, e.g. [config/exporter/stackdriver-exporter.json](https://github.com/kubernetes/node-problem-detector/blob/master/config/exporter/stackdriver-exporter.json), defaults to empty string. Set to empty string to disable.
131
131
132
132
### Deprecated Flags
133
133
134
-
*`--system-log-monitors`: List of paths to system log monitor config files, commaseparated. This option is deprecated, replaced by `--config.system-log-monitor`, and will be removed. NPD will panic if both `--system-log-monitors` and `--config.system-log-monitor` are set.
134
+
*`--system-log-monitors`: List of paths to system log monitor config files, comma-separated. This option is deprecated, replaced by `--config.system-log-monitor`, and will be removed. NPD will panic if both `--system-log-monitors` and `--config.system-log-monitor` are set.
135
135
136
-
*`--custom-plugin-monitors`: List of paths to custom plugin monitor config files, commaseparated. This option is deprecated, replaced by `--config.custom-plugin-monitor`, and will be removed. NPD will panic if both `--custom-plugin-monitors` and `--config.custom-plugin-monitor` are set.
136
+
*`--custom-plugin-monitors`: List of paths to custom plugin monitor config files, comma-separated. This option is deprecated, replaced by `--config.custom-plugin-monitor`, and will be removed. NPD will panic if both `--custom-plugin-monitors` and `--config.custom-plugin-monitor` are set.
137
137
138
138
## Build Image
139
139
@@ -153,19 +153,19 @@ before running `make`. For example:
Above command will compile the node-problem-detector without [Custom Plugin Monitor](https://github.com/kubernetes/node-problem-detector/tree/master/pkg/custompluginmonitor)
156
+
The above command will compile the node-problem-detector without [Custom Plugin Monitor](https://github.com/kubernetes/node-problem-detector/tree/master/pkg/custompluginmonitor)
157
157
and [System Stats Monitor](https://github.com/kubernetes/node-problem-detector/tree/master/pkg/systemstatsmonitor).
158
158
Check out the [Problem Daemon](https://github.com/kubernetes/node-problem-detector#problem-daemon) section
159
159
to see how to disable each problem daemon during compilation time.
160
160
161
161
**Note**:
162
-
By default node-problem-detector will be built with systemd support with `make` command. This requires systemd develop files.
163
-
You should download the systemd develop files first. For Ubuntu, `libsystemd-journal-dev` package should
164
-
be installed. For Debian, `libsystemd-dev` package should be installed.
162
+
By default, node-problem-detector will be built with systemd support with the`make` command. This requires systemd develop files.
163
+
You should download the systemd develop files first. For Ubuntu, the `libsystemd-journal-dev` package should
164
+
be installed. For Debian, the `libsystemd-dev` package should be installed.
165
165
166
166
## Push Image
167
167
168
-
`make push` uploads the docker image to registry. By default, the image will be uploaded to
168
+
`make push` uploads the docker image to a registry. By default, the image will be uploaded to
169
169
`staging-k8s.gcr.io`. It's easy to modify the `Makefile` to push the image
Or alternatively, to install node-problem-detector manually:
181
+
Alternatively, to install node-problem-detector manually:
182
182
183
183
1. Edit [node-problem-detector.yaml](deployment/node-problem-detector.yaml) to fit your environment. Set `log` volume to your system log directory (used by SystemLogMonitor). You can use a ConfigMap to overwrite the `config` directory inside the pod.
184
184
@@ -216,7 +216,7 @@ For example, to test [KernelMonitor](https://github.com/kubernetes/node-problem-
216
216
6.```sudo sh -c "echo 'kernel: INFO: task docker:20744 blocked for more than 120 seconds.' >> /dev/kmsg"```
217
217
7. You can see ```DockerHung``` event and condition in the node-problem-detector log.
218
218
8. You can see ```DockerHung``` condition at [http://127.0.0.1:20256/conditions](http://127.0.0.1:20256/conditions).
219
-
9. You can see diskrelated system metrics in Prometheus format at [http://127.0.0.1:20257/metrics](http://127.0.0.1:20257/metrics).
219
+
9. You can see disk-related system metrics in Prometheus format at [http://127.0.0.1:20257/metrics](http://127.0.0.1:20257/metrics).
220
220
221
221
**Note**:
222
222
- You can see more rule examples under [test/kernel_log_generator/problems](https://github.com/kubernetes/node-problem-detector/tree/master/test/kernel_log_generator/problems).
@@ -258,17 +258,17 @@ Kubernetes cluster to a healthy state. The following remedy systems exist:
258
258
259
259
NPD is tested via unit tests, [NPD e2e tests](https://github.com/kubernetes/node-problem-detector/blob/master/test/e2e/README.md), Kubernetes e2e tests and Kubernetes nodes e2e tests. Prow handles the [pre-submit tests](https://github.com/kubernetes/test-infra/blob/master/config/jobs/kubernetes/node-problem-detector/node-problem-detector-presubmits.yaml) and [CI tests](https://github.com/kubernetes/test-infra/blob/master/config/jobs/kubernetes/node-problem-detector/node-problem-detector-ci.yaml).
See [NPD e2e test documentation](https://github.com/kubernetes/node-problem-detector/blob/master/test/e2e/README.md) for how to setup and run NPD e2e tests.
271
+
See [NPD e2e test documentation](https://github.com/kubernetes/node-problem-detector/blob/master/test/e2e/README.md) for how to set up and run NPD e2e tests.
0 commit comments