KF Serving 2019 Roadmap

Q2 2019

Objective: "Simplify the user experience and provide a low barrier to entry by minimizing the amount of YAML necessary to deploy a trained model."

Objective: "Empower users to safely deploy production models by enabling a variety of deployment strategies."

Model Rollout
- Rollout a model using a blue-green strategy.
- Rollout a model using a pinned strategy.
- Rollout a model using a canary strategy.

Objective: "Reduce the total cost of ownership for models by minimizing the delta between provisioned resources and request load."

Autoscaling
- Scale a model to zero.
- Scale a model from zero without dropping traffic.
- Scale a model that is GPU bound.
- Scale a model that is CPU bound.

Define the API specification (owner ellisbigelow@)
- Explain complete data model
- Document common usage patterns to meet CUJs
Implement the API specification with a CRD (owner yuzisun@)
- Generate a Kubebuilder CRD
- Define golang protos as per spec
- Implement ValidatingAdmissionController for API Validation
- Implement ReconciliationHandler to generate subresources
Integrate a KFServing component with a SeldonDeployment (owner cliveseldon@)
- Determine integration strategy
- Implement integration

TBD