Skip to content

Commit 8b162d3

Browse files
authored
Merge pull request #4877 from macsko/kep_4247_update_1_32
KEP-4247: Update criteria targeting to enable the QueueingHints by default
2 parents 3dc528a + 4cf5175 commit 8b162d3

File tree

2 files changed

+17
-8
lines changed

2 files changed

+17
-8
lines changed

keps/sig-scheduling/4247-queueinghint/README.md

Lines changed: 15 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -528,7 +528,7 @@ This can inform certain test coverage improvements that we want to do before
528528
extending the production code to implement this enhancement.
529529
-->
530530

531-
- `k8s.io/kubernetes/pkg/scheduler/internal/queue`: `10-01 20:28 JST` - `88.4`
531+
- `k8s.io/kubernetes/pkg/scheduler/internal/queue`: `2024-09-26` - `92.8`
532532

533533
##### Integration tests
534534

@@ -594,15 +594,16 @@ n/a
594594
- The feature gate is implemented. (disabled by default)
595595
- QueueingHint implementation in all plugins.
596596
- The integration tests are implemented for requeueing scenarios in all plugins.
597-
- `PreCheck` feature in the scheduling queue is completely removed.
598-
- No significant degradation in memory comsumption.
599-
- No performance degradation is confirmed via scheduler_perf.
597+
- `PreCheck` feature in the scheduling queue is disabled when SchedulerQueueingHints is enabled.
598+
- No significant degradation in memory comsumption based on `scheduler_inflight_events` metric.
599+
- scheduler_perf covers the performance of most QueueingHintFn for in-tree plugins.
600+
- scheduler_perf runs with QueueingHint both enabled and disabled for all test cases and throughput when enabled is better or, at least, comparable.
601+
- Event handling duration is monitored using scheduler_perf.
600602
- The feature gate is enabled by default.
601-
- No bug report for a while after enabling it by default.
602603

603604
#### GA
604605

605-
- No bug report for a while after reaching Beta.
606+
- No bug report for a while after reaching Beta and enabling it by default.
606607

607608
### Upgrade / Downgrade Strategy
608609

@@ -777,6 +778,10 @@ that might indicate a serious problem?
777778
Maybe something goes wrong with QueueingHint and Pods are stuck in the queue if
778779
- `scheduler_pending_pods` metric with `queue: unschedulable` label grows and keeps high number abnormally
779780
- `pod_scheduling_sli_duration_seconds` metric grows abnormally
781+
Probably inFlightEvents list is not cleaning up properly when
782+
- `scheduler_inflight_events` metric grows abnormally as well as isn't close to 0 when no scheduling is happening
783+
There could be a problem with QueueingHint performance if
784+
- `scheduler_queueing_hint_execution_duration_seconds` and `scheduler_event_handling_duration_seconds` metrics are unexpectedly high
780785

781786
###### Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?
782787

@@ -864,6 +869,9 @@ Pick one more of these and delete the rest.
864869
- `schedule_attempts_total`
865870
- `scheduling_algorithm_duration_seconds`
866871
- `scheduler_pending_pods` with `queue: unschedulable`
872+
- `scheduler_inflight_events`
873+
- `scheduler_queueing_hint_execution_duration_seconds`
874+
- `scheduler_event_handling_duration_seconds`
867875
- Components exposing the metric: kube-scheduler
868876

869877
###### Are there any missing metrics that would be useful to have to improve observability of this feature?
@@ -1063,6 +1071,7 @@ Major milestones might include:
10631071
- Oct 01, 2023: The initial KEP is submitted.
10641072
- Dec 13, 2023: The feature gate is changed to be disabled by default.
10651073
- Dec 31, 2023: The KEP is updated based on the situation as of v1.30 release cycle. The beta/GA criteria is sorted.
1074+
- Sep 26, 2024: The KEP is updated as QueueingHint is targeting to be enabled by default in the v1.32 release.
10661075

10671076
## Drawbacks
10681077

keps/sig-scheduling/4247-queueinghint/kep.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,12 +18,12 @@ see-also:
1818

1919
stage: beta
2020

21-
latest-milestone: "v1.29"
21+
latest-milestone: "v1.32"
2222

2323
milestone:
2424
alpha: "v1.26" # This KEP stems from /keps/sig-node/3063-dynamic-resource-allocation.
2525
beta: "v1.28"
26-
stable: "v1.32"
26+
stable: "v1.34"
2727

2828
feature-gates:
2929
- name: SchedulerQueueingHints

0 commit comments

Comments
 (0)