Skip to content

Commit ad7d5dd

Browse files
committed
fix: address reviews
1 parent f4be31f commit ad7d5dd

File tree

2 files changed

+13
-9
lines changed

2 files changed

+13
-9
lines changed

keps/sig-scheduling/4832-async-preemption/README.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -241,15 +241,15 @@ We'll add test cases that multiple pods are trigger preemption.
241241
**Upgrade**
242242

243243
During the alpha period, users have to enable the feature gate `SchedulerAsyncPreemption` to opt in this feature.
244-
This is purely internal feature for kube-scheduler, so no other special actions are required outside the scheduler.
244+
This is purely in-memory feature for kube-scheduler, so no other special actions are required outside the scheduler.
245245

246246
**Downgrade**
247247

248248
Users need to disable the feature gate.
249249

250250
### Version Skew Strategy
251251

252-
This is purely internal feature for kube-scheduler, and hence no version skew strategy.
252+
This is purely in-memory feature for kube-scheduler, and hence no version skew strategy.
253253

254254
## Production Readiness Review Questionnaire
255255

@@ -269,7 +269,7 @@ This is purely internal feature for kube-scheduler, and hence no version skew st
269269

270270
###### Does enabling the feature change any default behavior?
271271

272-
No.
272+
No. The feature is a performance optimization that affects every Pod that needs preemption, but there are no functional changes: the result of the preemption is the same.
273273
But, like mentioned in [When kube-apiserver is unstable](#when-kube-apiserver-is-unstable), scheduling results could be different.
274274

275275
###### Can the feature be disabled once it has been enabled (i.e. can we roll back the enablement)?
@@ -284,7 +284,7 @@ The scheduler again starts to run PostFilter asynchronously.
284284

285285
###### Are there any tests for feature enablement/disablement?
286286

287-
Given it's purely internal feature and enablement/disablement requires restarting the component (to change the value of feature flag),
287+
Given it's purely in-memory feature and enablement/disablement requires restarting the component (to change the value of feature flag),
288288
having feature tests is enough.
289289

290290
### Rollout, Upgrade and Rollback Planning
@@ -319,6 +319,8 @@ No.
319319
###### How can an operator determine if the feature is in use by workloads?
320320

321321
This feature is used during all Pods' preemption if the feature gate is enabled.
322+
You can see if the scheduler triggers any preemptions via `preemption_attempts_total` metric.
323+
322324
You can find Pods that have triggered the preemption by referring to `.Status.NominatedNodeName`,
323325
and Pods that have been preempted by referring to their condition with `type: DisruptionTarget` and `reason: PreemptionByScheduler`.
324326

@@ -339,8 +341,8 @@ and Pods that have been preempted by referring to their condition with `type: Di
339341

340342
###### Are there any missing metrics that would be useful to have to improve observability of this feature?
341343

342-
- `goroutines_duration_seconds` (w/ label: `operation`): to observe how many preemption goroutines have failed.
343-
- `goroutines_execution_total` (w/ labels: `operation`, `result`): to observe how long each preemption goroutine takes to complete.
344+
- `goroutines_duration_seconds` (w/ label: `operation`): to observe how long each preemption goroutine takes to complete.
345+
- `goroutines_execution_total` (w/ labels: `operation`, `result`): to observe how many preemption goroutines have failed.
344346

345347
### Dependencies
346348

keps/sig-scheduling/4832-async-preemption/kep.yaml

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,9 +6,9 @@ owning-sig: sig-scheduling
66
status: provisional
77
creation-date: 2024-09-07
88
reviewers:
9-
- TBD
9+
- alculquicondor
1010
approvers:
11-
- TBD
11+
- alculquicondor
1212

1313
see-also:
1414
- "/keps/sig-scheduling/624-scheduling-framework"
@@ -31,4 +31,6 @@ disable-supported: true
3131

3232
# The following PRR answers are required at beta release
3333
metrics:
34-
- TBD
34+
- preemption_attempts_total
35+
- goroutines_duration_seconds
36+
- goroutines_execution_total

0 commit comments

Comments
 (0)