Skip to content

Create scale up/down test #347

@elevran

Description

@elevran

End to end test to validate that scaling the model server deployment up/down affects scheduling.

  • start with 1 replica (EPP configuration should be minimal, if any, so it's easier to rationalize how traffic is split)
  • generate traffic going to replica 1
  • add second replica, eventually traffic should be split
  • scale down back to 1 and ensure no errors as a replica is removed from EPP state

Metadata

Metadata

Assignees

Labels

needs-triageIndicates an issue or PR lacks a `triage/foo` label and requires one.

Type

No type

Projects

Status

In progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions