-
Notifications
You must be signed in to change notification settings - Fork 36
Open
Labels
enhancementNew feature or requestNew feature or requestkind/enhancementCategorizes issue or PR as related to a new feature, enhancement or improvementCategorizes issue or PR as related to a new feature, enhancement or improvement
Description
Summary
Proposal for kubectl-grove, a kubectl plugin that provides a rich interaction layer for Grove workloads on Kubernetes. The CLI bridges the gap between raw kubectl commands and the complex, hierarchical nature of Grove resources (PodCliqueSets, PodGangs, PodCliques, Pods), offering both command-line tools and an interactive Terminal User Interface (TUI) called Arborist.
Motivation
Managing distributed AI/ML workloads with Grove involves understanding complex resource hierarchies and placement topologies. Users currently need to:
- Run multiple
kubectl getcommands to understand the state of their deployment - Manually correlate PodCliqueSets → PodGangs → PodCliques → Pods relationships
- Lack visibility into GPU allocation, topology placement, and fragmentation
- Have no intuitive way to visualize how pods are distributed across racks and nodes
Proposed Features
Critical (Must Have)
- Arborist TUI (
kubectl grove tui) - Hierarchical navigation with real-time refresh and embedded topology view - Topology Command (
kubectl grove topology) - Rack → Node → Pod visualization with GPU allocation bars
High Priority
kubectl grove status- PodCliqueSet status with progress visualizationkubectl grove health- Gang-aware health dashboardkubectl grove diagnostics- Comprehensive diagnostic data collection
Medium Priority
- Lifecycle commands:
rollout,scale,update,restart,apply kubectl grove metrics- Live metrics from pod endpoints
/kind feature
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestkind/enhancementCategorizes issue or PR as related to a new feature, enhancement or improvementCategorizes issue or PR as related to a new feature, enhancement or improvement