|
| 1 | +# KEP-281: Dynamic Kubelet Configuration |
| 2 | + |
| 3 | +<!-- toc --> |
| 4 | +- [Release Signoff Checklist](#release-signoff-checklist) |
| 5 | +- [Summary](#summary) |
| 6 | + - [Risks and Mitigations](#risks-and-mitigations) |
| 7 | + - [Test Plan](#test-plan) |
| 8 | + - [Graduation Criteria](#graduation-criteria) |
| 9 | + - [Upgrade / Downgrade Strategy](#upgrade--downgrade-strategy) |
| 10 | + - [Version Skew Strategy](#version-skew-strategy) |
| 11 | +- [Production Readiness Review Questionnaire](#production-readiness-review-questionnaire) |
| 12 | + - [Feature Enablement and Rollback](#feature-enablement-and-rollback) |
| 13 | + - [Rollout, Upgrade and Rollback Planning](#rollout-upgrade-and-rollback-planning) |
| 14 | + - [Monitoring Requirements](#monitoring-requirements) |
| 15 | + - [Dependencies](#dependencies) |
| 16 | + - [Scalability](#scalability) |
| 17 | + - [Troubleshooting](#troubleshooting) |
| 18 | +- [Implementation History](#implementation-history) |
| 19 | +- [Drawbacks](#drawbacks) |
| 20 | +- [Alternatives](#alternatives) |
| 21 | +- [Infrastructure Needed (Optional)](#infrastructure-needed-optional) |
| 22 | +<!-- /toc --> |
| 23 | + |
| 24 | +## Release Signoff Checklist |
| 25 | + |
| 26 | +Items marked with (R) are required *prior to targeting to a milestone / release*. |
| 27 | + |
| 28 | +- [X] (R) Enhancement issue in release milestone, which links to KEP dir in [kubernetes/enhancements] (not the initial KEP PR) |
| 29 | +- [X] (R) KEP approvers have approved the KEP status as `implementable` |
| 30 | +- [X] (R) Design details are appropriately documented |
| 31 | +- [X] (R) Test plan is in place, giving consideration to SIG Architecture and SIG Testing input |
| 32 | +- [X] (R) Graduation criteria is in place |
| 33 | +- [ ] (R) Production readiness review completed |
| 34 | +- [ ] Production readiness review approved |
| 35 | +- [ ] "Implementation History" section is up-to-date for milestone |
| 36 | +- [ ] User-facing documentation has been created in [kubernetes/website], for publication to [kubernetes.io] |
| 37 | +- [ ] Supporting documentation—e.g., additional design documents, links to mailing list discussions/SIG meetings, relevant PRs/issues, release notes |
| 38 | + |
| 39 | +## Summary |
| 40 | + |
| 41 | +Dynamic Kubelet Configuration allows a new Kubelet configurations to be rolled out in a live cluster. |
| 42 | + |
| 43 | +The feature predates the KEP process as it is defined today. Please find |
| 44 | +motivation, goals, and design section in [community repository](https://github.com/kubernetes/community/blob/master/contributors/design-proposals/node/dynamic-kubelet-configuration.md). |
| 45 | + |
| 46 | +### Risks and Mitigations |
| 47 | + |
| 48 | +TODO for the feature deprecation effort |
| 49 | + |
| 50 | +### Test Plan |
| 51 | + |
| 52 | +TODO for the feature deprecation effort |
| 53 | + |
| 54 | +### Graduation Criteria |
| 55 | + |
| 56 | +TODO for the feature deprecation effort |
| 57 | + |
| 58 | +### Upgrade / Downgrade Strategy |
| 59 | + |
| 60 | +TODO for the feature deprecation effort |
| 61 | + |
| 62 | +### Version Skew Strategy |
| 63 | + |
| 64 | +TODO for the feature deprecation effort |
| 65 | + |
| 66 | +## Production Readiness Review Questionnaire |
| 67 | + |
| 68 | +### Feature Enablement and Rollback |
| 69 | + |
| 70 | +_This section must be completed when targeting alpha to a release._ |
| 71 | + |
| 72 | +* **How can this feature be enabled / disabled in a live cluster?** |
| 73 | + |
| 74 | +TODO for the feature deprecation effort |
| 75 | + |
| 76 | +* **Does enabling the feature change any default behavior?** |
| 77 | + |
| 78 | +TODO for the feature deprecation effort |
| 79 | + |
| 80 | +* **Can the feature be disabled once it has been enabled (i.e. can we roll back the enablement)?** |
| 81 | + |
| 82 | +TODO for the feature deprecation effort |
| 83 | + |
| 84 | +* **What happens if we reenable the feature if it was previously rolled back?** |
| 85 | + |
| 86 | +TODO for the feature deprecation effort |
| 87 | + |
| 88 | +* **Are there any tests for feature enablement/disablement?** |
| 89 | + |
| 90 | +TODO for the feature deprecation effort |
| 91 | + |
| 92 | +### Rollout, Upgrade and Rollback Planning |
| 93 | + |
| 94 | +* **How can a rollout fail? Can it impact already running workloads?** |
| 95 | + |
| 96 | +TODO for the feature deprecation effort |
| 97 | + |
| 98 | +* **What specific metrics should inform a rollback?** |
| 99 | + |
| 100 | +TODO for the feature deprecation effort |
| 101 | + |
| 102 | +* **Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?** |
| 103 | + |
| 104 | +TODO for the feature deprecation effort |
| 105 | + |
| 106 | +### Monitoring Requirements |
| 107 | + |
| 108 | +* **How can an operator determine if the feature is in use by workloads?** |
| 109 | + |
| 110 | +TODO for the feature deprecation effort |
| 111 | + |
| 112 | +* **What are the SLIs (Service Level Indicators) an operator can use to determine the health of the service?** |
| 113 | + |
| 114 | +TODO for the feature deprecation effort |
| 115 | + |
| 116 | +* **What are the reasonable SLOs (Service Level Objectives) for the above SLIs?** |
| 117 | + |
| 118 | +TODO for the feature deprecation effort |
| 119 | + |
| 120 | +* **Are there any missing metrics that would be useful to have to improve observability of this feature?** |
| 121 | + |
| 122 | +TODO for the feature deprecation effort |
| 123 | + |
| 124 | +### Dependencies |
| 125 | + |
| 126 | +* **Does this feature depend on any specific services running in the cluster?** |
| 127 | + |
| 128 | +TODO for the feature deprecation effort |
| 129 | + |
| 130 | +### Scalability |
| 131 | + |
| 132 | +* **Will enabling / using this feature result in any new API calls?** |
| 133 | + |
| 134 | +TODO for the feature deprecation effort |
| 135 | + |
| 136 | +* **Will enabling / using this feature result in introducing new API types?** |
| 137 | + |
| 138 | +TODO for the feature deprecation effort |
| 139 | + |
| 140 | +* **Will enabling / using this feature result in any new calls to the cloud provider?** |
| 141 | + |
| 142 | +TODO for the feature deprecation effort |
| 143 | + |
| 144 | +* **Will enabling / using this feature result in increasing size or count of the existing API objects?** |
| 145 | + |
| 146 | +TODO for the feature deprecation effort |
| 147 | + |
| 148 | +* **Will enabling / using this feature result in increasing time taken by any operations covered by [existing SLIs/SLOs]?** |
| 149 | + |
| 150 | +TODO for the feature deprecation effort |
| 151 | + |
| 152 | +* **Will enabling / using this feature result in non-negligible increase of resource usage (CPU, RAM, disk, IO, ...) in any components?** |
| 153 | + |
| 154 | +TODO for the feature deprecation effort |
| 155 | + |
| 156 | +### Troubleshooting |
| 157 | + |
| 158 | +* **How does this feature react if the API server and/or etcd is unavailable?** |
| 159 | + |
| 160 | +TODO for the feature deprecation effort |
| 161 | + |
| 162 | +* **What are other known failure modes?** |
| 163 | + |
| 164 | +TODO for the feature deprecation effort |
| 165 | + |
| 166 | +* **What steps should be taken if SLOs are not being met to determine the problem?** |
| 167 | + |
| 168 | +TODO for the feature deprecation effort |
| 169 | + |
| 170 | +## Implementation History |
| 171 | + |
| 172 | +- 1.8 Alpha release |
| 173 | +- 1.9 Incremental improvements working towards 1.10 goals |
| 174 | +- 1.11 Beta release |
| 175 | + |
| 176 | +## Drawbacks |
| 177 | + |
| 178 | +TODO for the feature deprecation effort |
| 179 | + |
| 180 | +## Alternatives |
| 181 | + |
| 182 | +TODO for the feature deprecation effort |
| 183 | + |
| 184 | +## Infrastructure Needed (Optional) |
| 185 | + |
| 186 | +TODO for the feature deprecation effort |
0 commit comments