You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -139,15 +140,15 @@ Items marked with (R) are required *prior to targeting to a milestone / release*
139
140
-[X] (R) Design details are appropriately documented
140
141
-[X] (R) Test plan is in place, giving consideration to SIG Architecture and SIG Testing input (including test refactors)
141
142
-[ ] e2e Tests for all Beta API Operations (endpoints)
142
-
-[] (R) Ensure GA e2e tests for meet requirements for [Conformance Tests](https://github.com/kubernetes/community/blob/master/contributors/devel/sig-architecture/conformance-tests.md)
143
-
-[] (R) Minimum Two Week Window for GA e2e tests to prove flake free
143
+
-[X] (R) Ensure GA e2e tests meet requirements for [Conformance Tests](https://github.com/kubernetes/community/blob/master/contributors/devel/sig-architecture/conformance-tests.md)
144
+
-[X] (R) Minimum Two Week Window for GA e2e tests to prove flake free
144
145
-[X] (R) Graduation criteria is in place
145
-
-[] (R) [all GA Endpoints](https://github.com/kubernetes/community/pull/1806) must be hit by [Conformance Tests](https://github.com/kubernetes/community/blob/master/contributors/devel/sig-architecture/conformance-tests.md)
146
-
-[] (R) Production readiness review completed
147
-
-[] (R) Production readiness review approved
146
+
-[X] (R) [all GA Endpoints](https://github.com/kubernetes/community/pull/1806) must be hit by [Conformance Tests](https://github.com/kubernetes/community/blob/master/contributors/devel/sig-architecture/conformance-tests.md)
147
+
-[X] (R) Production readiness review completed
148
+
-[X] (R) Production readiness review approved
148
149
-[X] "Implementation History" section is up-to-date for milestone
149
150
-[X] User-facing documentation has been created in [kubernetes/website], for publication to [kubernetes.io]
150
-
-[] Supporting documentation—e.g., additional design documents, links to mailing list discussions/SIG meetings, relevant PRs/issues, release notes
151
+
-[X] Supporting documentation—e.g., additional design documents, links to mailing list discussions/SIG meetings, relevant PRs/issues, release notes
151
152
152
153
<!--
153
154
**Note:** This checklist is iterative and should be reviewed and updated every time this enhancement is being considered for a milestone.
@@ -536,6 +537,11 @@ in back-to-back releases.
536
537
* Enable feature gate for e2e pipelines
537
538
* Add e2e tests
538
539
540
+
#### GA
541
+
542
+
* Real-world usage
543
+
* ([The LeaderWorkerSet API (LWS)](https://github.com/kubernetes-sigs/lws/blob/main/docs/setup/install.md))
544
+
539
545
### Upgrade / Downgrade Strategy
540
546
541
547
<!--
@@ -562,15 +568,19 @@ components? What are the guarantees? Make sure this is in the test plan.
562
568
563
569
Consider the following in developing a version skew strategy for this
564
570
enhancement:
565
-
- Does this enhancement involve coordinating behavior in the control plane and
566
-
in the kubelet? How does an n-2 kubelet without this feature available behave
567
-
when this feature is used?
571
+
- Does this enhancement involve coordinating behavior in the control plane and nodes?
572
+
- How does an n-3 kubelet or kube-proxy without this feature available behave when this feature is used?
573
+
- How does an n-1 kube-controller-manager or kube-scheduler without this feature available behave when this feature is used?
568
574
- Will any other components on the node change? For example, changes to CSI,
569
575
CRI or CNI may require updating that component before the kubelet.
570
576
-->
571
577
572
578
There are only `kube-controller-manager` changes involved (in addition to the apiserver changes for dealing with the new StatefulSet field). Node components are not involved so there is no version skew between nodes and the control plane.
573
579
580
+
An n-1 `kube-controller-manager` will have the same effect (when applicable) as rolling back
581
+
to a version where this feature is not enabled. See
582
+
[Rollout, upgrade and rollback planning](#rollout-upgrade-and-rollback-planning) for details.
583
+
574
584
## Production Readiness Review Questionnaire
575
585
576
586
<!--
@@ -729,7 +739,7 @@ Longer term, we may want to require automated upgrade/rollback tests, but we
729
739
are missing a bunch of machinery and tooling and can't do that now.
730
740
-->
731
741
732
-
Manual upgrade->downgrade->upgrade scenario (to be validated):
742
+
A manual upgrade->downgrade->upgrade scenario was performed:
733
743
734
744
- Create a cluster on a version that doesn't use this feature (eg: 1.26)
735
745
- Upgrade a cluster to a version that uses this feature (eg: 1.27)
@@ -784,7 +794,7 @@ and operation of this feature.
784
794
Recall that end users cannot usually observe component logs or access metrics.
785
795
-->
786
796
787
-
-[] Other (treat as last resort)
797
+
-[X] Other (treat as last resort)
788
798
- Details: The user can inspect the pods that are created by the StatefulSet
789
799
which match the StatefulSet's selector.
790
800
@@ -959,6 +969,22 @@ This through this both in small and large cases, again with respect to the
959
969
960
970
No. Resource usage remains the same with this feature.
961
971
972
+
###### Can enabling / using this feature result in resource exhaustion of some node resources (PIDs, sockets, inodes, etc.)?
973
+
974
+
<!--
975
+
Focus not just on happy cases, but primarily on more pathological cases
976
+
(e.g. probes taking a minute instead of milliseconds, failed pods consuming resources, etc.).
977
+
If any of the resources can be exhausted, how this is mitigated with the existing limits
978
+
(e.g. pods per node) or new limits added by this KEP?
979
+
980
+
Are there any tests that were run/should be run to understand performance characteristics better
981
+
and validate the declared limits?
982
+
-->
983
+
984
+
No. This feature runs only on the control plane (StatefulSet controller within `kube-controller-manager`).
985
+
It also doesn't result in any increased node usage, as the number of expected StatefulSet replicas
986
+
remains constant whether this feature is enabled (`.spec.ordinals.start` is set).
987
+
962
988
### Troubleshooting
963
989
964
990
<!--
@@ -1045,7 +1071,8 @@ Major milestones might include:
0 commit comments