Alert Behavior Summary

Alert Thresholds

High Severity (Delinquency): 15-minute cooldown
- Critical alerts need faster re-notification
Low Severity (SSH/RPC): 30-minute cooldown
- Infrastructure alerts are less urgent
Prevents alert spam during extended outages
Each validator and node has independent cooldowns

Delinquency alerts ONLY fire when:
- Validator hasn't voted for 30+ seconds AND
- SSH is working (no consecutive failures) AND
- RPC is working (no consecutive failures)
This prevents false delinquency alerts during infrastructure issues

SSH fails for 25 minutes, then recovers
- No alert sent (didn't reach 30 minutes)
- Timer resets on recovery
RPC fails for 35 minutes continuously
- Alert sent at 30 minutes
- No more RPC alerts for 30 minutes (low severity cooldown)
Validator not voting + RPC is down
- No delinquency alert (can't verify voting status)
- Only RPC failure alert after 30 minutes
Multiple short failures
- Fail 10 min → Success → Fail 10 min → Success
- No alerts (each failure period < 30 minutes)
- Timer resets after each success