-With the rapid growth of enterprise business, a single Kubernetes cluster often cannot meet the demands of large-scale AI training and inference tasks. Users typically need to manage multiple Kubernetes clusters to achieve unified workload distribution, deployment, and management. Currently, multi-cluster orchestration systems in the industry (such as [Karmada](https://karmada.io/)) primarily target microservices scenarios, providing high availability and disaster recovery deployment capabilities. However, in terms of AI job scheduling, Karmada's capabilities are still limited. It lacks support for **Volcano Job** and cannot meet requirements such as queue management, multi-tenant fair scheduling, and job priority scheduling.
0 commit comments