node: 2625: add content for GA graduation

ffromani · ffromani · commit da6db8248c12 · 2025-02-11T10:16:33.000+01:00
File missing content

Signed-off-by: Francesco Romani &lt;fromani@redhat.com&gt;
diff --git a/keps/sig-node/2625-cpumanager-policies-thread-placement/README.md b/keps/sig-node/2625-cpumanager-policies-thread-placement/README.md
@@ -29,6 +29,7 @@
   - [Graduation Criteria of Options](#graduation-criteria-of-options)
     - [Graduation of Options to <code>Beta-quality</code> (non-hidden)](#graduation-of-options-to-beta-quality-non-hidden)
     - [Graduation of Options from <code>Beta-quality</code> to <code>G.A-quality</code>](#graduation-of-options-from-beta-quality-to-ga-quality)
+  - [Removal of the CPUManagerPolicyAlphaOptions and CPUManagerPolicyBetaOptions feature gates](#removal-of-the-cpumanagerpolicyalphaoptions-and-cpumanagerpolicybetaoptions-feature-gates)
   - [Upgrade / Downgrade Strategy](#upgrade--downgrade-strategy)
   - [Version Skew Strategy](#version-skew-strategy)
 - [Production Readiness Review Questionnaire](#production-readiness-review-questionnaire)
@@ -94,7 +95,8 @@ to consider thread-level allocation, to avoid physical CPU sharing and prevent p
 
 ### Non-Goals
 
-TBD
+* Add new cpumanager policies. The community feedback and the conversation we gathered when proposing this KEP were all in favor
+  of adding options to fine-tune the behavior of the static policy rather than adding new policies.
 
 ## Proposal
 
@@ -206,19 +208,23 @@ to implement this enhancement.
 
 ##### Prerequisite testing updates
 
-TBD
-
 ##### Unit tests
 
-- `<package>`: `<date>` - `<test coverage>`
+- `k8s.io/kubernetes/pkg/kubelet/cm/cpumanager`:            `20250130` - 85.6% of statements
+- `k8s.io/kubernetes/pkg/kubelet/cm/cpumanager/state`:      `20250130` - 88.1% of statements
+- `k8s.io/kubernetes/pkg/kubelet/cm/cpumanager/topology`:   `20250130` - 85.0% of statements
 
 ##### Integration tests
 
-- <test>: <link to test coverage>
+- kubelet features don't have usually integration tests. We use a combination of unit tests and `e2e_node` tests.
 
 ##### e2e tests
 
-- <test>: <link to test coverage>
+testgrid:
+- https://testgrid.k8s.io/sig-node-kubelet#kubelet-serial-gce-e2e-cpu-manager
+- https://testgrid.k8s.io/sig-node-kubelet#kubelet-gce-e2e-arm64-ubuntu-serial
+- https://testgrid.k8s.io/sig-node-containerd#pull-e2e-serial-ec2
+- https://testgrid.k8s.io/sig-node-containerd#node-kubelet-containerd-resource-managers
 
 ### Test Plan
 
@@ -248,8 +254,7 @@ NOTE: Even though the feature gate is enabled by default the user still has to e
   The alpha-quality options are hidden by default and only if the `CPUManagerPolicyAlphaOptions` feature gate is enabled the user has the ability to use them.
   The beta-quality options are visible by default, and the feature gate allows a positive acknowledgement that non stable features are being used, and also allows to optionally turn them off.
   Based on the graduation criteria described below, a policy option will graduate from a group to the other (alpha to beta).
-  We plan to removete the `CPUManagerPolicyAlphaOptions` and `CPUManagerPolicyBetaOptions` after all options graduated to stable, after a feature cycle passes without new planned options, and not before 1.28, to give ample time to the work in progress option to graduate at least to beta.
-- Since the feature that allows the ability to customize the behaviour of CPUManager static policy as well as the CPUManager Policy option `full-pcpus-only` were both introduced in 1.22 release and meet the above graduation criterion, `full-pcpus-only` would be considered as a non-hidden option i.e. available to be used when explicitly used along with `CPUManagerPolicyOptions` Kubelet flag in the kubelet config or command line argument called `cpumanager-policy-options` .
+- Since the feature that allows the ability to customize the behaviour of CPUManager static policy as well as the CPUManager Policy option `full-pcpus-only` were both introduced in 1.22 release and meet the above graduation criterion, `full-pcpus-only` would be considered as a non-hidden option i.e. available to be used when explicitly used along with `CPUManagerPolicyOptions` Kubelet flag in the kubelet configuration or command line argument called `cpumanager-policy-options` .
 -  The introduction of this new feature gate gives us the ability to move the feature to beta and later stable without implying all that the options are beta or stable.
 
 The graduation Criteria of options is described below:
@@ -262,6 +267,22 @@ The graduation Criteria of options is described below:
 - [X] Allowing time for feedback (1 year) on the policy option.
 - [X] Risks have been addressed.
 
+### Removal of the CPUManagerPolicyAlphaOptions and CPUManagerPolicyBetaOptions feature gates
+
+This KEP added the `CPUManagerPolicyAlphaOptions` and `CPUManagerPolicyBetaOptions` group feature gates alongside the usual changes required to enable the `full-pcpus-only` option.
+
+We plan to remove the `CPUManagerPolicyAlphaOptions` and `CPUManagerPolicyBetaOptions` after all options graduated to stable.
+We will defer to the last graduating option the additional work to remove the gates. In case of two or more options graduating to GA and thus rendering the gates obsolete, a new minimal KEP should be issued to remove the gates.
+
+The SIG-node community is considering a redesign of the resource management based on NRI and DRA technologies (possibly extended) for the future.
+There were conversation and attempts about this topic since the 1.27 cycle and [past attempts already](https://github.com/kubernetes/enhancements/issues/3675).
+We thus expect a gradual slowdown of additions to cpumanager, including policy options. At time of writing (1.33 cycle) we have 6 policy options at various degree of maturity.
+We expect the rate of proposal of new options to greatly slow down and to stop entirely once the community moves to the future, yet unplanned, resource management architecture.
+
+For the reasons above, we believe it's unlikely we will need to add back the `CPUManagerPolicyAlphaOptions` and `CPUManagerPolicyBetaOptions` feature gates once removed.
+Should new options be proposed and agreed by the community, the recommendation is to graduate using specific feature gates per standard process.
+Considering the expected slowdown, we expect the standard graduation process to be much more manageable in the future, if needed at all.
+
 ### Upgrade / Downgrade Strategy
 
 We expect no impact. The new policies are opt-in and separated by the existing ones.
@@ -317,48 +338,51 @@ Kubelet may fail to start. The kubelet may crash.
 
 ###### What specific metrics should inform a rollback?
 
-The number of pod ending up in Failed for SMTAlignmentError could be used to decide a rollback.
+We can use `cpu_manager_pinning_errors_total` to see all the allocation errors, irrespective of the specific reason though.
+In addition, we can use the logs: the number of pod ending up in Failed for SMTAlignmentError could be used to decide a rollback.
 
 ###### Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?
 
 Not Applicable.
 
 ###### Is the rollout accompanied by any deprecations and/or removals of features, APIs, fields of API types, flags, etc.?
+
 No.
 
 ### Monitoring requirements
 
 ###### How can an operator determine if the feature is in use by workloads?
 
+- Check the metric `container_aligned_compute_resources_count` with the label `boundary=physical_cpu`
 - Inspect the kubelet configuration of the nodes: check feature gates and usage of the new options
 
 ###### How can someone using this feature know that it is working for their instance?
 
-- [ ] Events
-  - Event Reason:
-- [ ] API .status
-  - Condition name:
-  - Other field:
-- [ ] Other (treat as last resort)
+- [X] Other (treat as last resort)
   - Details:
+    - check metrics and their interplay:
+        * the metric `container_aligned_compute_resources_count` with the label `boundary=physical_cpu`
+        * the metric `cpu_manager_pinning_requests_total`
+        * the metric `cpu_manager_pinning_errors_total`
 
 ###### What are the reasonable SLOs (Service Level Objectives) for the enhancement?
 
 N/A.
 
 ###### What are the SLIs (Service Level Indicators) an operator can use to determine the health of the service?
 
-- [ ] Metrics
+- [X] Metrics
   - Metric name:
-  - [Optional] Aggregation method:
-  - Components exposing the metric:
-- [ ] Other (treat as last resort)
-  - Details:
+    * the metric `container_aligned_compute_resources_count` with the label `boundary=physical_cpu`
+        * the metric `cpu_manager_pinning_requests_total`
+        * the metric `cpu_manager_pinning_errors_total`
+  - Components exposing the metric: kubelet
 
 ###### Are there any missing metrics that would be useful to have to improve observability of this feature?
 
-TBD
-
+We can detail the pinning errors total with a new metric like `cpu_manager_errors_count` or
+`container_aligned_compute_resources_failure_count` using the same labels as we use for `container_aligned_compute_resources_count`.
+These metrics will be added before to graduate to GA.
 
 ### Dependencies
 
@@ -394,7 +418,7 @@ No.
 
 ###### Can enabling / using this feature result in resource exhaustion of some node resources (PIDs, sockets, inodes, etc.)?
 
-TBD
+No.
 
 ### Troubleshooting
 
@@ -404,11 +428,11 @@ No effect.
 
 ###### What are other known failure modes?
 
-No known failure mode. (TBD)
+Allocation failures can lead to workload not going running. The only remediation is to disable the features and restart the kubelets.
 
 ###### What steps should be taken if SLOs are not being met to determine the problem?
 
-N/A (TBD)
+Inspect the metrics and possibly the logs to learn the failure reason
 
 [supported limits]: https://git.k8s.io/community//sig-scalability/configs-and-limits/thresholds.md
 [existing SLIs/SLOs]: https://git.k8s.io/community/sig-scalability/slos/slos.md#kubernetes-slisslos
diff --git a/keps/sig-node/2625-cpumanager-policies-thread-placement/kep.yaml b/keps/sig-node/2625-cpumanager-policies-thread-placement/kep.yaml
@@ -5,7 +5,7 @@ authors:
   - "@swatisehgal"
 owning-sig: sig-node
 participating-sigs: []
-status: implemented
+status: implementable
 creation-date: "2021-04-14"
 last-updated: "2025-01-30"
 reviewers: