Skip to content

Commit befabb8

Browse files
committed
add triage slo burnrate breaches
1 parent c2b4ca7 commit befabb8

File tree

4 files changed

+45
-108
lines changed

4 files changed

+45
-108
lines changed

raw-migrated-files/docs-content/serverless/observability-triage-slo-burn-rate-breaches.md

Lines changed: 0 additions & 51 deletions
This file was deleted.

raw-migrated-files/observability-docs/observability/triage-slo-burn-rate-breaches.md

Lines changed: 0 additions & 50 deletions
This file was deleted.

raw-migrated-files/toc.yml

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -242,7 +242,6 @@ toc:
242242
- file: docs-content/serverless/observability-plaintext-application-logs.md
243243
- file: docs-content/serverless/observability-send-application-logs.md
244244
- file: docs-content/serverless/observability-stream-log-files.md
245-
- file: docs-content/serverless/observability-triage-slo-burn-rate-breaches.md
246245
- file: docs-content/serverless/observability-triage-threshold-breaches.md
247246
- file: docs-content/serverless/observability-view-alerts.md
248247
- file: docs-content/serverless/observability-view-infrastructure-metrics.md
@@ -499,7 +498,6 @@ toc:
499498
- file: observability-docs/observability/monitor-status-alert.md
500499
- file: observability-docs/observability/obs-ai-assistant.md
501500
- file: observability-docs/observability/observability-get-started.md
502-
- file: observability-docs/observability/triage-slo-burn-rate-breaches.md
503501
- file: observability-docs/observability/triage-threshold-breaches.md
504502
- file: observability-docs/observability/view-infrastructure-metrics.md
505503
- file: observability-docs/observability/view-observability-alerts.md

solutions/observability/incident-management/triage-slo-burn-rate-breaches.md

Lines changed: 45 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,13 +2,53 @@
22
mapped_urls:
33
- https://www.elastic.co/guide/en/observability/current/triage-slo-burn-rate-breaches.html
44
- https://www.elastic.co/guide/en/serverless/current/observability-triage-slo-burn-rate-breaches.html
5+
6+
navigation_title: "SLO burn rate breaches"
57
---
68

7-
# Triage SLO burn rate breaches
9+
# Triage SLO burn rate breaches [triage-slo-burn-rate-breaches]
10+
11+
12+
SLO burn rate breaches occur when the percentage of bad events over a specified time period exceeds the threshold set in your [SLO burn rate rule](../../../solutions/observability/incident-management/create-an-slo-burn-rate-rule.md). When this happens, you are at risk of exhausting your error budget and violating your SLO.
13+
14+
To triage issues quickly, go to the alert details page:
15+
16+
1. Open the SLO and click **Alerts**.
17+
2. From the Alerts table, click the ![More actions](../../../images/observability-boxesHorizontal.svg "") icon next to the alert and select **View alert details**.
18+
19+
The alert details page shows information about the alert, including when the alert was triggered, the duration of the alert, the source SLO, and the rule that triggered the alert. You can follow the links to navigate to the source SLO or rule definition.
20+
21+
Explore charts on the page to learn more about the SLO breach:
22+
23+
* **Burn rate chart**. The first chart shows the burn rate during the time range when the alert was active. The line indicates how close the SLO came to breaching the threshold.
24+
25+
:::{image} ../../../images/observability-slo-burn-rate-breach.png
26+
:alt: Alert details for SLO burn rate breach
27+
:class: screenshot
28+
:::
29+
30+
::::{tip}
31+
The timeline is annotated to show when the threshold was breached. You can hover over an alert icon to see the timestamp of the alert.
32+
33+
::::
34+
35+
* **Alerts history chart**. The next chart provides information about alerts for the same rule and group over the last 30 days. It shows the number of those alerts that were triggered per day, the total number of alerts triggered throughout the 30 days, and the average time it took to recover after a breach.
36+
37+
:::{image} ../../../images/observability-log-threshold-breach-alert-history-chart.png
38+
:alt: Alert history chart in alert details for SLO burn rate breach
39+
:class: screenshot
40+
:::
41+
42+
43+
The number, duration, and frequency of these breaches over time gives you an indication of how severely the service is degrading so that you can focus on high severity issues first.
44+
45+
::::{note}
46+
The contents of the alert details page may vary depending on the type of SLI that’s defined in the SLO.
47+
::::
848

9-
% What needs to be done: Align serverless/stateful
1049

11-
% Use migrated content from existing pages that map to this page:
50+
After investigating the alert, you may want to:
1251

13-
% - [ ] ./raw-migrated-files/observability-docs/observability/triage-slo-burn-rate-breaches.md
14-
% - [ ] ./raw-migrated-files/docs-content/serverless/observability-triage-slo-burn-rate-breaches.md
52+
* Click **Snooze the rule** to snooze notifications for a specific time period or indefinitely.
53+
* Click the ![Actions](../../../images/observability-boxesVertical.svg "") icon and select **Add to case** to add the alert to a new or existing case. To learn more, refer to [Cases](../../../solutions/observability/incident-management/cases.md).
54+
* Click the ![Actions](../../../images/observability-boxesVertical.svg "") icon and select **Mark as untracked**. When an alert is marked as untracked, actions are no longer generated. You can choose to move active alerts to this state when you disable or delete rules.

0 commit comments

Comments
 (0)