Skip to content

Milestones

List view

  • Areas of focus: - Batch inference serving - EPP next-hop proxy exploration - Token in/out support

    No due date
    12/29 issues closed
  • Areas of focus - Implement and document well lit paths for - Multi-model/LoRA - Disaggregated P/D - Data parallel serving - SLO and priorities - Support autoscaler scale from zero - DX improvements - Remove Python dependencies from code base / CICD - Streamline docker builds - Extend integration and end-to-end tests

    Overdue by 1 month(s)
    Due by January 11, 2026
    8/8 issues closed