-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Emergency Runbook
Zack Stickles edited this page Feb 22, 2021
·
11 revisions
This site is hosted on AWS Amplify and is maintained / supported by New Relic's Developer Experience team. If you have any questions or comments, feel free to create an issue or reach out to us at [email protected].
- Troubleshooting dashboard
- #help-dev-experience (for engineering requests)
- #documentation (for content requests)
- Alert policy
- Architecture diagram (TODO: add link when #823 has been completed)
| Scenario | Severity | Resolution |
|---|---|---|
| Site is not loading | ❗ High | Rollback a release |
| Functionality is broken | Rollback a release | |
| Alert has been triggered | Respond to an incident | |
| Copy needs to be adjusted | 👀 Unknown |
Create an issue or ping @hero in #documentation
|
If the site is not loading, or a piece of functionality is broken, you may will likely need to rollback to a stable release using the following steps:
- To do.
The following steps are for on-call engineers working at New Relic:
- Don't panic, you've got this!
- Check to see if there is already an ongoing incident in #emergency-room (or one of the supplemental rooms).
- If there is not an ongoing incident, start one by following the steps in the Incident Commander Runbook.
- Refer to the troubleshooting dashboard to get an idea for what could be going on.
- Look at the recent deployments to production to identify a PR that can be reverted.