forked from vllm-project/vllm-omni
-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Motivation.
Parent RFC:
vllm-project#984
Proposed Change.
Create the central coordination service for instance discovery
in /vllim_omni/distributed/data_parallel/
1.1.1 Implement OmniCoordinator
1.1.2 Implement OmniCoordClientForStage
1.1.3 Implement OmniCoordClientForHub
1.1.4 Defines exchange messages and enums
Create the LoadBalancer
in /vllim_omni/distributed/load_balancer/
1.2.1 Implement LoadBalancer base class, LoadBalancer.select(*) inputs are the task and the instance list obtained from ClientForHub
1.2.2 Implement RandomBalancer as a conceret subclass of LoadBalancer
Unit Tests
in /tests/distributed/
1.3.1 Unit tests for DPCoordinator test_dp_coordinator.py
1.3.2 Unit tests for LoadBalancer test_load_balancer.py
Design Doc
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels