There are many cases when Node become broken when system pods are failing to run. For example:
- Image pull issues
- Init container failures
- Other Crash Loop Backoff cases
I think introducing a new Node Condition when any of the system namespace pods has failed will be very useful.
Seeking for feedback - are there ever cases when system pods are failing to run and node is not "degraded".
Also, may be a rule like this may be configurable to include additional namespaces to check.
There are many cases when Node become broken when system pods are failing to run. For example:
I think introducing a new Node Condition when any of the system namespace pods has failed will be very useful.
Seeking for feedback - are there ever cases when system pods are failing to run and node is not "degraded".
Also, may be a rule like this may be configurable to include additional namespaces to check.