@@ -818,19 +818,16 @@ No effect.
818
818
819
819
# ##### What are other known failure modes?
820
820
821
- <!--
822
- For each of them, fill in the following information by copying the below template :
823
- - [Failure mode brief description]
824
- - Detection : How can it be detected via metrics? Stated another way:
825
- how can an operator troubleshoot without logging into a master or worker node?
826
- - Mitigations : What can be done to stop the bleeding, especially for already
827
- running user workloads?
828
- - Diagnostics : What are the useful log messages and their required logging
829
- levels that could help debug the issue?
830
- Not required until feature graduated to beta.
831
- - Testing : Are there any tests for failure mode? If not, describe why.
832
- -->
833
-
821
+ - Configuration file cannot be loaded at server start
822
+ - Detection : API server process exits
823
+ - Mitigation : Revert to previous success invocation or configuration
824
+ - Diagnostics : Configuration validation errors are logged at default verbosity.
825
+ - Testing : Configuration file loading and validation is unit tested
826
+ - Configuration file cannot be reloaded while server is running
827
+ - Detection : ` apiserver_authorization_config_controller_automatic_reload_last_timestamp_seconds` metric
828
+ indicates the `failure` status timestamp is most recent.
829
+ - Mitigation : Revert to previous success invocation or configuration
830
+ - Diagnostics : Configuration validation errors are logged at default verbosity.
834
831
835
832
# ##### What steps should be taken if SLOs are not being met to determine the problem?
836
833
0 commit comments