Skip to content

Deferred ResourceQuota Enforcement for pods with SchedulingReadinessGatesΒ #5465

@ichekrygin

Description

@ichekrygin

Enhancement Description

Current ResourceQuota enforcement is strict: if a pod's resource request exceeds the remaining quota, it is rejected during admission. This is inconsistent with other Kubernetes paradigms, such as node resource availability, where unschedulable pods are allowed to remain in a Pending state.

In addition, there is a notable misalignment between quota usage accounting and pod readiness state. Pods that are gated by mechanisms like PodSchedulingReadiness cannot yet be scheduled or consume actual cluster resources, yet they are still counted against quota usage. This leads to inefficiencies and unnecessary admission rejections, particularly for systems managing asynchronous or staged workload execution.

The strict admission approach:

  • Breaks compatibility with asynchronous or deferred scheduling patterns.
  • Prevents advanced workload orchestrators (e.g., Kueue) from managing workloads that temporarily exceed quota.
  • Impedes resource overcommit strategies even if final scheduling may be feasible after other pods complete.

This proposal introduces a more natural, flexible alternative.

Please keep this description up to date. This will help the Enhancement Team to track the evolution of the enhancement efficiently.

Metadata

Metadata

Assignees

No one assigned

    Labels

    sig/appsCategorizes an issue or PR as relevant to SIG Apps.sig/schedulingCategorizes an issue or PR as relevant to SIG Scheduling.

    Type

    No type

    Projects

    Status

    Needs Triage

    Status

    Needs Triage

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions