You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: keps/sig-network/2079-network-policy-port-range/README.md
+23-19Lines changed: 23 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -199,7 +199,7 @@ validation should be done by CNIs.
199
199
200
200
#### Beta
201
201
-`EndPort` has been supported for at least 1 minor release
202
-
-Four commonly used NetworkPolicy (or CNI providers) implement the new field,
202
+
-Three commonly used NetworkPolicy (or CNI providers) implement the new field,
203
203
with generally positive feedback on its usage.
204
204
- Feature Gate is enabled by Default.
205
205
@@ -220,7 +220,6 @@ start working incorrectly. This is a fail-closed failure, so it is acceptable.
220
220
221
221
### Feature Enablement and Rollback
222
222
223
-
_This section must be completed when targeting alpha to a release._
224
223
225
224
***How can this feature be enabled / disabled in a live cluster?**
226
225
-[X] Feature gate (also fill in values in `kep.yaml`)
@@ -253,7 +252,7 @@ _This section must be completed when targeting alpha to a release._
253
252
254
253
***Are there any tests for feature enablement/disablement?**
255
254
256
-
No - unit tests will be added later.
255
+
Yes and they can be found [here](https://github.com/kubernetes/kubernetes/blob/release-1.21/pkg/registry/networking/networkpolicy/strategy_test.go#L284)
257
256
258
257
### Rollout, Upgrade and Rollback Planning
259
258
@@ -266,7 +265,8 @@ _This section must be completed when targeting beta graduation to a release._
266
265
The increase of 5xx http error count on Network Policies Endpoint
267
266
268
267
***Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?**
269
-
This will be done.
268
+
Yes, with unit tests.
269
+
There's still some need to make manual tests, that will be done in a follow up.
270
270
271
271
***Is the rollout accompanied by any deprecations and/or removals of features, APIs,
272
272
None
@@ -280,13 +280,25 @@ _This section must be completed when targeting beta graduation to a release._
280
280
an object specifying the range and validating if the traffic is allowed within
281
281
the specified range
282
282
283
+
***How can someone using this feature know that it is working for their instance?
284
+
-[x] Other
285
+
- Details:
286
+
The API Field must be present when a NetworkPolicy is created with that field.
287
+
The feature working correctly depends on the CNI implementation, so the operator can
288
+
look into CNI metrics to check if the rules are being applied correctly, like Calico
289
+
that provides metrics like `felix_iptables_restore_errors` that can be used to
290
+
verify if the amount of restoring errors raised after the feature being applied.
291
+
We might need in a future to add some Status field that allows CNI providers to provide
292
+
feedback about the functionality
293
+
283
294
***What are the SLIs (Service Level Indicators) an operator can use to determine
284
295
the health of the service?**
285
-
Operators would need to monitor the traffic of the Pods to verify if a
286
-
specified port range is applied and allowed in their workloads
296
+
Operators can use metrics provided by the CNI to use as SLI, like
297
+
`felix_iptables_restore_errors` from Calico to verify if the errors rate
298
+
has raised.
287
299
288
300
***What are the reasonable SLOs (Service Level Objectives) for the above SLIs?**
289
-
N/A
301
+
- per-day percentage of API calls finishing with 5XX errors <= 1% is a reasonable SLO
290
302
291
303
***Are there any missing metrics that would be useful to have to improve observability
292
304
of this feature?**
@@ -296,19 +308,11 @@ of this feature?**
296
308
### Dependencies
297
309
298
310
***Does this feature depend on any specific services running in the cluster?**
299
-
No
311
+
Yes, a CNI supporting the new feature
300
312
301
313
302
314
### Scalability
303
315
304
-
_For alpha, this section is encouraged: reviewers should consider these questions
305
-
and attempt to answer them._
306
-
307
-
_For beta, this section is required: reviewers must answer these questions._
308
-
309
-
_For GA, this section is required: approvers should be able to confirm the
310
-
previous answers based on experience in the field._
311
-
312
316
***Will enabling / using this feature result in any new API calls?**
313
317
No
314
318
@@ -337,8 +341,6 @@ resource usage (CPU, RAM, disk, IO, ...) in any components?**
337
341
338
342
### Troubleshooting
339
343
340
-
_This section must be completed when targeting beta graduation to a release._
341
-
342
344
***How does this feature react if the API server and/or etcd is unavailable?**
343
345
As this feature is mainly used by CNI providers, the reaction with API server
344
346
and/or etcd being unavailable will be the same as before.
@@ -347,9 +349,11 @@ _This section must be completed when targeting beta graduation to a release._
347
349
N/A
348
350
349
351
***What steps should be taken if SLOs are not being met to determine the problem?**
350
-
N/A
352
+
Remove EndPort field and check if the number of errors reduce, although this might
353
+
lead to undesired Network Policy, blocking previously working rules.
351
354
352
355
## Implementation History
356
+
- 2021-05-11 Propose Beta graduation and add more Performance Review data
0 commit comments