@@ -646,7 +646,7 @@ you need any help or guidance.
646
646
_ This section must be completed when targeting alpha to a release._
647
647
648
648
* ** How can this feature be enabled / disabled in a live cluster?**
649
- - [ ] Feature gate (also fill in values in ` kep.yaml ` )
649
+ - [X ] Feature gate (also fill in values in ` kep.yaml ` )
650
650
- Feature gate name: ` GracefulNodeShutdown `
651
651
- Components depending on the feature gate:
652
652
- ` kubelet `
@@ -696,17 +696,26 @@ _This section must be completed when targeting beta graduation to a release._
696
696
Try to be as paranoid as possible - e.g., what if some components will restart
697
697
mid-rollout?
698
698
699
+ This feature should not impact rollouts.
700
+
699
701
* ** What specific metrics should inform a rollback?**
700
702
703
+ N/A.
704
+
701
705
* ** Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?**
702
706
Describe manual testing that was done and the outcomes.
703
707
Longer term, we may want to require automated upgrade/rollback tests, but we
704
708
are missing a bunch of machinery and tooling and can't do that now.
705
709
710
+ The feature is part of kubelet config so updating kubelet config should
711
+ enable/disable the feature; upgrade/downgrade is N/A.
712
+
706
713
* ** Is the rollout accompanied by any deprecations and/or removals of features, APIs,
707
714
fields of API types, flags, etc.?**
708
715
Even if applying deprecation policies, they may still surprise some users.
709
716
717
+ No.
718
+
710
719
### Monitoring Requirements
711
720
712
721
_ This section must be completed when targeting beta graduation to a release._
@@ -716,6 +725,8 @@ _This section must be completed when targeting beta graduation to a release._
716
725
checking if there are objects with field X set) may be a last resort. Avoid
717
726
logs or events for this purpose.
718
727
728
+ Check if the feature gate and kubelet config settings are enabled on a node.
729
+
719
730
* ** What are the SLIs (Service Level Indicators) an operator can use to determine
720
731
the health of the service?**
721
732
- [ ] Metrics
@@ -725,6 +736,8 @@ the health of the service?**
725
736
- [ ] Other (treat as last resort)
726
737
- Details:
727
738
739
+ N/A
740
+
728
741
* ** What are the reasonable SLOs (Service Level Objectives) for the above SLIs?**
729
742
At a high level, this usually will be in the form of "high percentile of SLI
730
743
per day <= X". It's impossible to provide comprehensive guidance, but at the very
@@ -734,11 +747,15 @@ the health of the service?**
734
747
job creation time) for cron job <= 10%
735
748
- 99,9% of /health requests per day finish with 200 code
736
749
750
+ N/A.
751
+
737
752
* ** Are there any missing metrics that would be useful to have to improve observability
738
753
of this feature?**
739
754
Describe the metrics themselves and the reasons why they weren't added (e.g., cost,
740
755
implementation difficulties, etc.).
741
756
757
+ N/A.
758
+
742
759
### Dependencies
743
760
744
761
_ This section must be completed when targeting beta graduation to a release._
@@ -757,6 +774,8 @@ _This section must be completed when targeting beta graduation to a release._
757
774
- Impact of its outage on the feature:
758
775
- Impact of its degraded performance or high-error rates on the feature:
759
776
777
+ No, this feature doesn't depend on any specific services running the cluster.
778
+ It only depends on systemd running on the node itself.
760
779
761
780
### Scalability
762
781
@@ -780,27 +799,37 @@ previous answers based on experience in the field._
780
799
- periodic API calls to reconcile state (e.g. periodic fetching state,
781
800
heartbeats, leader election, etc.)
782
801
802
+ No.
803
+
783
804
* ** Will enabling / using this feature result in introducing new API types?**
784
805
Describe them, providing:
785
806
- API type
786
807
- Supported number of objects per cluster
787
808
- Supported number of objects per namespace (for namespace-scoped objects)
788
809
810
+ No.
811
+
789
812
* ** Will enabling / using this feature result in any new calls to the cloud
790
813
provider?**
791
814
815
+ No.
816
+
792
817
* ** Will enabling / using this feature result in increasing size or count of
793
818
the existing API objects?**
794
819
Describe them, providing:
795
820
- API type(s):
796
821
- Estimated increase in size: (e.g., new annotation of size 32B)
797
822
- Estimated amount of new objects: (e.g., new Object X for every existing Pod)
798
823
824
+ No.
825
+
799
826
* ** Will enabling / using this feature result in increasing time taken by any
800
827
operations covered by [ existing SLIs/SLOs] ?**
801
828
Think about adding additional work or introducing new steps in between
802
829
(e.g. need to do X to start a container), etc. Please describe the details.
803
830
831
+ No.
832
+
804
833
* ** Will enabling / using this feature result in non-negligible increase of
805
834
resource usage (CPU, RAM, disk, IO, ...) in any components?**
806
835
Things to keep in mind include: additional in-memory state, additional
@@ -809,6 +838,8 @@ resource usage (CPU, RAM, disk, IO, ...) in any components?**
809
838
This through this both in small and large cases, again with respect to the
810
839
[ supported limits] .
811
840
841
+ No.
842
+
812
843
### Troubleshooting
813
844
814
845
The Troubleshooting section currently serves the ` Playbook ` role. We may consider
@@ -819,6 +850,8 @@ _This section must be completed when targeting beta graduation to a release._
819
850
820
851
* ** How does this feature react if the API server and/or etcd is unavailable?**
821
852
853
+ The feature does not depend on the API server / etcd.
854
+
822
855
* ** What are other known failure modes?**
823
856
For each of them, fill in the following information by copying the below template:
824
857
- [ Failure mode brief description]
@@ -836,6 +869,8 @@ _This section must be completed when targeting beta graduation to a release._
836
869
[ supported limits ] : https://git.k8s.io/community//sig-scalability/configs-and-limits/thresholds.md
837
870
[ existing SLIs/SLOs ] : https://git.k8s.io/community/sig-scalability/slos/slos.md#kubernetes-slisslos
838
871
872
+ N/A.
873
+
839
874
## Implementation History
840
875
841
876
<!--
0 commit comments