Skip to content

Automated Disaster Recovery (DR) Drill Orchestrator #239

@OtowoSamuel

Description

@OtowoSamuel

🔴 Difficulty: High (200 Points)

DR is only useful if it's tested. We want the operator to automatically run "DR Drills".

✅ Acceptance Criteria

  • Add a drDrillSchedule to the CRD.
  • Periodically trigger a fake failover (by killing the primary or simulating network latency).
  • Measure the Time to Recovery (TTR).
  • Verify the standby successfully took over and the application stayed alive.
  • Generate a report after the drill.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Stellar WaveIssues in the Stellar wave programenhancementNew feature or requestreliabilityReliability and stabilitystellar-waveStellar Wave Program

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions