Slo prediction experimental #1677

kaushikmitr · 2025-10-03T01:14:58Z

PR #1677 – Add batch prediction capability and lightGBM support to prediction sidecars

Overview

This PR enhances the latency predictor and scheduling pipeline in the Gateway API Inference Extension, introducing batch prediction support, consistent SLO header handling, improved test/deployment flows, and infrastructure updates. Batch predictions (prediction TTFT/TPOT for all pods in a single API call to the sidecars) makes things much more efficient.

Key Changes

Batch Prediction & SLO Headers

Added batch prediction support in the async latency predictor (latencypredictor_async.go) and updated tests.
Normalized all SLO-related HTTP headers to lowercase for consistent handling across clients and proxies.

Prediction Server & Model Support

Added LightGBM as a supported model, with proper runtime dependency installation (libgomp1) to prevent OpenMP errors.
Updated prediction_server.py logic to support multiple models and fallback handling.

Testing & CI/CD

Introduced a dedicated Dockerfile-test that builds a containerized test image running pytest by default.
Extended build-deploy.sh with new commands (test, test-deploy, all, images) to automate build → deploy → test workflows.
Added a Kubernetes batch job manifest (test-dual-server-deployment.yaml) for end-to-end CI-like test execution.

…rediction-experimental

kaushikmitr · 2025-10-03T01:23:36Z

@ahg-g @kfswain @BenjaminBraunDev

kfswain · 2025-10-03T03:26:52Z

/lgtm
/approve

k8s-ci-robot · 2025-10-03T03:27:01Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: kaushikmitr, kfswain

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [kfswain]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

* add latency predictor build readme * update test dual server * allow batch prediction * allow batch prediction, update slo headers to all small

kaushikmitr added 5 commits September 16, 2025 23:34

add latency predictor build readme

308ac4d

update test dual server

2f15be4

Merge branch 'kubernetes-sigs:slo-prediction-experimental' into slo-p…

13f9660

…rediction-experimental

allow batch prediction

676bd37

allow batch prediction, update slo headers to all small

2e808b2

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Oct 3, 2025

k8s-ci-robot requested review from elevran and robscott October 3, 2025 01:15

k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Oct 3, 2025

k8s-ci-robot assigned kfswain Oct 3, 2025

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 3, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 3, 2025

k8s-ci-robot merged commit 0901896 into kubernetes-sigs:slo-prediction-experimental Oct 3, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slo prediction experimental #1677

Slo prediction experimental #1677

Uh oh!

kaushikmitr commented Oct 3, 2025 •

edited

Loading

Uh oh!

kaushikmitr commented Oct 3, 2025 •

edited

Loading

Uh oh!

kfswain commented Oct 3, 2025

Uh oh!

k8s-ci-robot commented Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!

Slo prediction experimental #1677

Slo prediction experimental #1677

Uh oh!

Conversation

kaushikmitr commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR #1677 – Add batch prediction capability and lightGBM support to prediction sidecars

Overview

Key Changes

Batch Prediction & SLO Headers

Prediction Server & Model Support

Testing & CI/CD

Uh oh!

kaushikmitr commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kfswain commented Oct 3, 2025

Uh oh!

k8s-ci-robot commented Oct 3, 2025

Uh oh!

Uh oh!

Uh oh!

kaushikmitr commented Oct 3, 2025 •

edited

Loading

kaushikmitr commented Oct 3, 2025 •

edited

Loading