Context
PagerDuty has alerted us about an incident with a P1 priority.
The incident has the following metadata:
- title: [FIRING:1] At least two servers failed to start in the last 30m projectpythia-binder binderhub 3 (kubeconfig immediate action needed)
- incident id: Q2FNVY88B3QCPT
- service: Server Startup Failure
- type: Base Incident
Details of the incident can be found on PagerDuty.
What we need to do
Follow the incident response process https://compass.2i2c.org/services/interactive-computing/incidents/
Definition of Done
All steps listed in https://compass.2i2c.org/services/interactive-computing/incidents/process/#steps have been followed.