OpenShift Virtualization Storage Management Tools

Tools for managing VM storage relationships and automating storage class migrations at scale.

Quick Start

# 1. Visualize VM storage relationships
./vm-tree.py fedora-vm-with-storage --namespace default

# 2. Find all VMs using a storage class
./vm-tree.py --storage-class standard

# 3. Detect orphaned storage resources
./vm-tree.py --find-orphans

# 4. Plan a storage class migration
./storage-migration.py plan --from-sc standard --to-sc standard-fast -n default

# 5. Execute the migration (dry-run first!)
./storage-migration.py execute --from-sc standard --to-sc standard-fast -n default --dry-run

# 6. Execute actual migration
./storage-migration.py execute --from-sc standard --to-sc standard-fast -n default

# 7. Monitor migration progress in real-time (in a second terminal)
./migration-watch.py -n default --to-sc standard-fast

Motivation

OpenShift Virtualization's storage management has significant storage class migration challenges:

Manual storage migration: Updating storage classes requires manual cross-correlation of VMs, DataVolumes, PVCs, and PVs
Scale issues: For large deployments (40,000 VMs = 80,000+ PVCs), manual tracking is unmanageable
Data loss risk: No safeguards or visualization tools to prevent mistakes
Leftover resources: No easy way to identify orphaned storage after migration
No relationship visualization: Unlike kubectl tree, there's no tool to show VM → DV → PVC → PV relationships, which makes it impossible to track relationships between all the resources in the cluster.
Time for lift - Manually migration each VM takes ~30mins (with relationship tracking, workload backup, executing migration, cleaning up old resources etc). At scale, migrating 40,000+ VMs could end up taking 200+ hours. The plan for this tool is to bring the time down for that migration scenario to ~2 hours (1000x improvement!) - automatically plan the migration - execute - and monitor.

Tools

`vm-tree.py` - VM Storage Relationship Visualizer

A tool that visualizes the storage resource hierarchy for OpenShift Virtualization VMs.

Features:

Shows complete resource chain: VM → DataVolume → PVC → PV
Displays storage class, size, and reclaim policy
Warns about dangerous reclaim policies
Can show all VMs using a specific storage class
Detects orphaned storage resources (DataVolumes, PVCs, PVs not owned by any VM)
Works with both oc (OpenShift) and kubectl (vanilla Kubernetes)
Colored output for better readability

Usage:

# Show storage tree for a specific VM
./vm-tree.py <vm-name> --namespace <namespace-name>

# Show all VMs using a storage class (useful for migration planning)
./vm-tree.py --storage-class <storage-class-name>

# Find orphaned storage resources in a namespace
./vm-tree.py --find-orphans --namespace <namespace-name>

# Find orphaned resources across all namespaces
./vm-tree.py --find-orphans --all-namespaces

Example Output - VM Storage Tree:

$ ./vm-tree.py fedora-vm-with-storage --namespace default
================================================================================
  VM Storage Tree: fedora-vm-with-storage (namespace: default)
================================================================================

VirtualMachine: fedora-vm-with-storage
├─ UID: 86aa276e-2765-4902-bf0f-774f44eeb067
├─ Status: Stopped
│
├─ DataVolumes: (1 found)
│  └─ DataVolume: fedora-dv-inline
      ├─ Phase: WaitForFirstConsumer
      ├─ Size: 5Gi
      ├─ StorageClass: standard
      │
      └─ PersistentVolumeClaim: fedora-dv-inline
         ├─ Status: Pending
         └─ PersistentVolume: (not yet bound)

================================================================================

Example Output - Orphaned Resources:

$ ./vm-tree.py --find-orphans --namespace default
================================================================================
  Orphaned Storage Resources
  Namespace: default
================================================================================

Found 1 orphaned resource(s):

❌ Orphaned DataVolumes: 1
(Not owned by any VirtualMachine)

  • DataVolume: fedora-dv
    ├─ Namespace: default
    ├─ Size: 5Gi
    ├─ StorageClass: standard
    ├─ Phase: WaitForFirstConsumer
    └─ Created: 2026-02-10T18:48:10Z

================================================================================
⚠️  These resources are consuming storage but not used by any VM
⚠️  Consider cleaning up to reclaim storage
================================================================================

This directly addresses the scale issue (eg when cluster has 40,000 VMs), where orphaned resources become impossible to track manually. With this tool, wasted storage can be identified across the entire cluster.

`storage-migration.py` - Automated Storage Class Migration

A tool that automates the migration of VMs from one storage class to another, making storage migration at scale seameless.

Features:

Plan Mode: Analyzes impact before migration (dry-run, shows all affected VMs)
Execute Mode: Performs the actual migration with optional dry-run
Finds all VMs using a specific storage class
Calculates total storage to be migrated
Creates new DataVolumes on target storage class
Tracks migration with labels and timestamps
Works with both oc (OpenShift) and kubectl (vanilla Kubernetes)

The Value at Scale:

For 1 VM: Manual migration takes ~30 minutes
For 40,000 VMs: Manual process = 20,000+ hours (impossible)
With this tool: Plan once, execute in parallel, track automatically

Usage:

# Plan migration (shows what will be migrated)
./storage-migration.py plan --from-sc <storage-class-name> --to-sc <storage-class-name> -n <namespace-name>

# Execute migration with dry-run first
./storage-migration.py execute --from-sc <storage-class-name> --to-sc <storage-class-name> -n <namespace-name> --dry-run

# Execute actual migration
./storage-migration.py execute --from-sc <stroage-class-name> --to-sc <stroage-class-name> -n default

# Plan migration across all namespaces
./storage-migration.py plan --from-sc <storage-class-name> --to-sc <storage-class-name>

Example Output - Plan Mode:

$ ./storage-migration.py plan --from-sc standard --to-sc standard-fast -n default
================================================================================
  STORAGE MIGRATION PLAN
================================================================================

  From StorageClass: standard
  To StorageClass:   standard-fast
  Namespace:         default

✅ Target storage class 'standard-fast' exists

Searching for VMs using storage class 'standard'...
Found 10 VM(s) to migrate:

1. VM: test-vm-001 (namespace: default)
   ├─ Status: Stopped
   ├─ DataVolumes to migrate: 1
   └─ DataVolume: test-vm-001-disk
        ├─ Size: 3Gi
        └─ Current StorageClass: standard

2. VM: test-vm-002 (namespace: default)
   ├─ Status: Stopped
   ├─ DataVolumes to migrate: 1
   └─ DataVolume: test-vm-002-disk
        ├─ Size: 3Gi
        └─ Current StorageClass: standard
.
.
.
.

9. VM: test-vm-009 (namespace: default)
   ├─ Status: Stopped
   ├─ DataVolumes to migrate: 1
   └─ DataVolume: test-vm-009-disk
        ├─ Size: 6Gi
        └─ Current StorageClass: standard

10. VM: test-vm-010 (namespace: default)
   ├─ Status: Stopped
   ├─ DataVolumes to migrate: 1
   └─ DataVolume: test-vm-010-disk
        ├─ Size: 6Gi
        └─ Current StorageClass: standard

================================================================================
  MIGRATION SUMMARY
================================================================================
  VMs to migrate:        10
  DataVolumes to clone:  10
  Total storage:         ~52 Gi

  ⚠️  Migration will:
     1. Create new DataVolumes on 'standard-fast'
     2. Clone data from existing DataVolumes
     3. Update VMs to use new DataVolumes
     4. Mark old DataVolumes as orphaned

  ⚠️  Recommended steps:
     1. Stop all VMs before migration (prevents data corruption)
     2. Use --dry-run first to test
     3. Back up critical VMs
     4. Clean up orphaned resources after verification

================================================================================

Example Output - Dry-Run Mode:

$ ./storage-migration.py execute --from-sc standard --to-sc standard-fast -n default --dry-run
================================================================================
  EXECUTING STORAGE MIGRATION
  (DRY RUN MODE - No actual changes will be made)
================================================================================

Migrating 10 VM(s)...

[1/10] Migrating VM: test-vm-001
  Creating new DataVolume: test-vm-001-disk-migrated-1770822362
  [DRY RUN] Would create:

apiVersion: cdi.kubevirt.io/v1beta1
kind: DataVolume
metadata:
  name: test-vm-001-disk-migrated-1770822362
  namespace: default
  labels:
    storage-migration: "true"
    migration-timestamp: "2026-02-11T10:06:02.288228"
    source-sc: "standard"
    target-sc: "standard-fast"
spec:
  source:
    pvc:
      namespace: default
      name: test-vm-001-disk
  storage:
    storageClassName: standard-fast
    accessModes:
      - ReadWriteOnce
    resources:
      requests:
        storage: 3Gi

  Updating VM spec to use new DataVolumes...
  [DRY RUN] Would patch VM to use new DataVolumes

[2/10] Migrating VM: test-vm-002
  Creating new DataVolume: test-vm-002-disk-migrated-1770822362
  [DRY RUN] Would create:

apiVersion: cdi.kubevirt.io/v1beta1
kind: DataVolume
metadata:
  name: test-vm-002-disk-migrated-1770822362
  namespace: default
  labels:
    storage-migration: "true"
    migration-timestamp: "2026-02-11T10:06:02.288336"
    source-sc: "standard"
    target-sc: "standard-fast"
spec:
  source:
    pvc:
      namespace: default
      name: test-vm-002-disk
  storage:
    storageClassName: standard-fast
    accessModes:
      - ReadWriteOnce
    resources:
      requests:
        storage: 3Gi

  Updating VM spec to use new DataVolumes...
  [DRY RUN] Would patch VM to use new DataVolumes
.
.
.
[10/10] Migrating VM: test-vm-010
  Creating new DataVolume: test-vm-010-disk-migrated-1770822362
  [DRY RUN] Would create:

apiVersion: cdi.kubevirt.io/v1beta1
kind: DataVolume
metadata:
  name: test-vm-010-disk-migrated-1770822362
  namespace: default
  labels:
    storage-migration: "true"
    migration-timestamp: "2026-02-11T10:06:02.288560"
    source-sc: "standard"
    target-sc: "standard-fast"
spec:
  source:
    pvc:
      namespace: default
      name: test-vm-010-disk
  storage:
    storageClassName: standard-fast
    accessModes:
      - ReadWriteOnce
    resources:
      requests:
        storage: 6Gi

  Updating VM spec to use new DataVolumes...
  [DRY RUN] Would patch VM to use new DataVolumes

================================================================================
DRY RUN COMPLETE
Run without --dry-run to execute actual migration
================================================================================

Example Output - Execute Migration:

$ ./storage-migration.py execute --from-sc standard --to-sc standard-fast -n default
================================================================================
  EXECUTING STORAGE MIGRATION
================================================================================

Migrating 10 VM(s)...

[1/10] Migrating VM: test-vm-001
  Creating new DataVolume: test-vm-001-disk-migrated-1770826216
  ✅ DataVolume created
  Updating VM spec to use new DataVolumes...
  ✅ VM patched: test-vm-001-disk → test-vm-001-disk-migrated-1770826216

[2/10] Migrating VM: test-vm-002
  Creating new DataVolume: test-vm-002-disk-migrated-1770826217
  ✅ DataVolume created
  Updating VM spec to use new DataVolumes...
  ✅ VM patched: test-vm-002-disk → test-vm-002-disk-migrated-1770826217

[... output truncated for remaining 8 VMs ...]

================================================================================
MIGRATION INITIATED

Monitor progress with:
  kubectl get dv -n default -w

After migration completes:
  1. Verify VMs start successfully
  2. Find orphaned resources: ./vm-tree.py --find-orphans
  3. Clean up old DataVolumes
================================================================================

Finding and Cleaning Up Orphaned Resources:

After migration completes, the old DataVolumes still exist but are no longer actively used by VMs. Use --find-orphans to identify them:

./vm-tree.py --find-orphans

The tool detects orphaned resources by checking if DataVolumes have ownerReferences but are not actively referenced in VM specs. It then programmatically correlates them back to their original VMs using Kubernetes metadata relationships.

Example Output:

================================================================================
  Orphaned Storage Resources
  Namespace: default
================================================================================

Found 10 orphaned resource(s):

❌ Orphaned DataVolumes: 10
(Not owned by any VirtualMachine)

  • DataVolume: test-vm-001-disk
    ├─ Namespace: default
    ├─ Size: 3Gi
    ├─ StorageClass: standard
    ├─ Phase: WaitForFirstConsumer
    ├─ Belongs to VM: test-vm-001 (✓✓ very-high confidence)
    │  ├─ Reason: Migration source for: test-vm-001-disk-migrated-1770826216
    │  ├─ VM Status: Provisioning
    │  ├─ Replaced by: test-vm-001-disk-migrated-1770826216
    │  └─ VM's current DVs: test-vm-001-disk-migrated-1770826216
    │
    │  💡 Hint: This is an old disk from a storage migration.
    │      • The VM is now using the migrated disk(s)
    │      • If VM is working correctly, this can be safely deleted
    │      • If kept as backup, consider adding a 'backup' label for tracking
    └─ Created: 2026-02-11T16:09:53Z

  • DataVolume: test-vm-002-disk
    ├─ Namespace: default
    ├─ Size: 3Gi
    ├─ StorageClass: standard
    ├─ Phase: WaitForFirstConsumer
    ├─ Belongs to VM: test-vm-002 (✓✓ very-high confidence)
    │  ├─ Reason: Migration source for: test-vm-002-disk-migrated-1770826217
    │  ├─ VM Status: Provisioning
    │  ├─ Replaced by: test-vm-002-disk-migrated-1770826217
    │  └─ VM's current DVs: test-vm-002-disk-migrated-1770826217
    │
    │  💡 Hint: This is an old disk from a storage migration.
    │      • The VM is now using the migrated disk(s)
    │      • If VM is working correctly, this can be safely deleted
    │      • If kept as backup, consider adding a 'backup' label for tracking
    └─ Created: 2026-02-11T16:09:53Z

[... output truncated for remaining 8 VMs ...]

================================================================================
Cleanup Recommendations:

✓ 10 orphaned DV(s) from storage migrations
  → These were replaced by new DataVolumes on different storage classes
  → Verify VMs are working with new disks, then delete old ones

⚠️  All orphaned resources are consuming storage but not actively used
⚠️  Consider cleaning up to reclaim storage space
================================================================================

How It Works:

The orphan detection uses a programmatic approach (no name pattern guessing):

Build Active DV Set: Extracts DataVolumes actually referenced in each VM's spec.dataVolumeTemplates and spec.template.spec.volumes
Detect Orphans: Finds DataVolumes with ownerReferences but not in the active set (leftover from migration or manual changes)
Trace Migration: Checks if orphaned DV appears in any active DV's spec.source.pvc.name field
Correlate: Uses ownerReferences and source chains to identify which VM the orphaned disk belongs to

Confidence Levels:

✓✓ very-high: Migration source detected (DV has spec.source.pvc pointing to it) + ownerReference matches
✓ high: Has ownerReference to VM but not in VM spec (manual leftover)
**~ medium**: Partial correlation (less common)

Use Cases:

Accidental Leftover (Use Case 1): Admin migrates storage but forgets to clean up old disks. They stay in the cluster consuming storage on the SAN. The tool identifies these with "very-high confidence" and suggests safe deletion.
Intentional Backup (Use Case 2): Admin wants to keep old disk as safety copy during migration (e.g., migrating from old array to new array). The tool identifies the relationship and suggests adding a backup label for tracking.

`migration-watch.py` - Real-Time Migration Progress Monitor

A tool that monitors storage migration progress in real-time, providing live status updates and error reporting.

Features:

Real-time updates: Auto-refreshes every 5 seconds (configurable)
Progress bars: Visual representation of clone progress for each DataVolume
Statistics dashboard: Shows total, completed, in-progress, pending, and failed migrations
Color-coded status: Green=completed, Cyan=in-progress, Yellow=pending, Red=failed
Error reporting: Displays failure reasons and messages
Age tracking: Shows how long each migration has been running
Namespace filtering: Monitor specific namespace or all namespaces
Storage class filtering: Track migrations to a specific target storage class

Usage:

# Watch migrations in default namespace
./migration-watch.py -n default

# Watch migrations to specific storage class
./migration-watch.py -n default --to-sc standard-fast

# Watch across all namespaces
./migration-watch.py --all-namespaces

# Custom refresh interval (10 seconds instead of 5)
./migration-watch.py -n default --refresh 10

Example Output:

================================================================================
  STORAGE MIGRATION PROGRESS
  Namespace: default
  Updated: 2026-02-11 10:23:45
================================================================================

  Summary:
    Total DataVolumes:    10
    ✅ Completed:         3 (30.0%)
    ⏳ In Progress:       5
    ⏸  Pending:          2
    ❌ Failed:           0

  Overall Progress: [============                            ] 30.0%

================================================================================

NAMESPACE    NAME                              PHASE              PROGRESS              AGE
--------------------------------------------------------------------------------------------
default      test-vm-001-disk-migrated-...     Succeeded          [===============] 100%  5m
default      test-vm-002-disk-migrated-...     Succeeded          [===============] 100%  5m
default      test-vm-003-disk-migrated-...     Succeeded          [===============] 100%  4m
default      test-vm-004-disk-migrated-...     CloneInProgress    [=======        ] 45%   4m
default      test-vm-005-disk-migrated-...     CloneInProgress    [====           ] 25%   3m
default      test-vm-006-disk-migrated-...     CloneInProgress    [===            ] 20%   3m
default      test-vm-007-disk-migrated-...     CloneInProgress    [==             ] 15%   2m
default      test-vm-008-disk-migrated-...     CloneInProgress    [=              ] 10%   2m
default      test-vm-009-disk-migrated-...     Pending            ··············· N/A    1m
default      test-vm-010-disk-migrated-...     Pending            ··············· N/A    1m

Refreshing in 5s... (Press Ctrl+C to stop)

Workflow:

Open a second terminal while migration is running:

# Terminal 1: Execute migration
./storage-migration.py execute --from-sc standard --to-sc standard-fast -n default

# Terminal 2: Monitor progress
./migration-watch.py -n default --to-sc standard-fast

Key Benefits:

Visibility: See exactly what's happening during large-scale migrations
Early detection: Catch failures as they happen, not after the fact
Progress estimation: Understand how long migration will take
Passive monitoring: No need to manually run kubectl get dv repeatedly

This tool is essential for large scale migration where manual monitoring is impossible. Set it and monitor passively while migrations run in parallel.

Roadmap

Enhance storage-migration.py:
- Parallel execution: Migrate N VMs concurrently (configurable)
- Rollback capability: Revert if migration fails
Web UI:
- Interactive relationship graphs
- Migration planning dashboard
- Real-time progress monitoring

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
demos		demos
docs		docs
test-data		test-data
.gitignore		.gitignore
README.md		README.md
migration-watch.py		migration-watch.py
storage-migration.py		storage-migration.py
vm-tree.py		vm-tree.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenShift Virtualization Storage Management Tools

Quick Start

Motivation

Tools

`vm-tree.py` - VM Storage Relationship Visualizer

`storage-migration.py` - Automated Storage Class Migration

`migration-watch.py` - Real-Time Migration Progress Monitor

Roadmap

Resources

Contributing

License

About

Uh oh!

Releases

Packages

Languages

anik120/vm-tree

Folders and files

Latest commit

History

Repository files navigation

OpenShift Virtualization Storage Management Tools

Quick Start

Motivation

Tools

vm-tree.py - VM Storage Relationship Visualizer

storage-migration.py - Automated Storage Class Migration

migration-watch.py - Real-Time Migration Progress Monitor

Roadmap

Resources

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`vm-tree.py` - VM Storage Relationship Visualizer

`storage-migration.py` - Automated Storage Class Migration

`migration-watch.py` - Real-Time Migration Progress Monitor

Packages