You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/alerts/monitors/create-monitor.md
+19-19Lines changed: 19 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -69,13 +69,13 @@ When you create a monitor and open the metrics search query in the Metrics Explo
69
69
70
70
## Step 1. Set trigger conditions
71
71
72
-
The first step when creating a new monitor is setting the **Trigger Conditions**. Choose Logs, Metrics, or SLO, enter or select a query, and set thresholds to trigger alerts.
72
+
The first step when creating a new monitor is setting the **Trigger Conditions**.
73
73
74
-
### Monitor type
74
+
### Monitor Type
75
75
76
76
Select a **Monitor Type**, which will create alerts based on [Logs](/docs/search/), [Metrics](/docs/metrics/metrics-queries/), or an [SLO](/docs/observability/reliability-management-slo/).<br/><img src={useBaseUrl('img/alerts/monitors/trigger-conditions-monitor.png')} alt="Monitor types" width="250"/>
77
77
78
-
### Detection method
78
+
### Detection Method
79
79
80
80
Next, select a **Detection Method** (not applicable to SLO monitors).
81
81
@@ -136,15 +136,14 @@ You can set a logs monitor trigger to alert based on the following:
136
136
137
137
Triggers are evaluated by balancing the requirement of timely alert notifications while ensuring that monitor data is indeed available to evaluate trigger conditions.
138
138
139
-
* For static logs monitors, triggers are similar to "Alert when the result is greater than _ within Y Minutes". The triggers are evaluated periodically as below.
139
+
* For [static logs monitors](#static-detection-method), triggers are similar to "Alert when the result is greater than _ within Y Minutes". The triggers are evaluated periodically as below.
140
140
| When detection window (Y) is | Evaluate trigger every |
* For outlier logs monitors, triggers are evaluated every 5 minutes.
147
-
* For anomaly logs monitors, triggers are evaluated every `timeslice` as specified in the monitor query. For example, the below query is evaluated every 2 minutes.
146
+
* For [anomaly logs monitors](#anomaly-detection-method), triggers are evaluated every `timeslice` as specified in the monitor query. For example, the below query is evaluated every 2 minutes.
148
147
```
149
148
_sourceCategory=Labs/Apache/Access
150
149
| timeslice 2m
@@ -154,16 +153,17 @@ Triggers are evaluated by balancing the requirement of timely alert notification
154
153
| sum(successes) as success_cnt, sum(fails) as fail_cnt by _timeslice
155
154
| (fail_cnt/(success_cnt+fail_cnt)) * 100 as failure_rate_pct
156
155
```
156
+
* For [outlier logs monitors](#outlier-detection-method), triggers are evaluated every 5 minutes.
157
157
158
158
When configuring monitor trigger conditions, you can set a resolution window to resolve alerts quickly once the underlying issue is fixed. The resolution window specifies how long a monitor will wait before resolving an alert after the issue is corrected.
159
159
160
160
For example, if your monitor evaluates the last 1 hour, you can set a resolution window of 15 minutes. Once the resolution window is continuously satisfied for 15 minutes, the alert will resolve automatically.<br/><img src={useBaseUrl('img/alerts/monitors/config-resolution-window-2.png')} alt="config-resolution-window" style={{border: '1px solid gray'}} width="700"/>
`Alert when returned row count is <threshold type> <threshold> within <time range>`
169
169
@@ -177,7 +177,7 @@ The recovery condition is set by default to the opposite of the alert condition.
177
177
178
178
For example, if an alert is set to `greater than 10`, the recovery would be set to `less than or equal to 10` when inferred. Sumo Logic automatically resolves the incident when the resolution condition is satisfied.
@@ -219,7 +219,7 @@ Tune the number of anomalous data points detected per day compared to the predic
219
219
220
220
The recovery condition will always be the opposite of the alerting condition. For example, if there is no outlier identified for the duration of the detection window from the time the alert was first fired, then the Monitor will be brought back to the normal state. You cannot customize the resolution condition for the Monitor.
@@ -244,7 +244,7 @@ For example, if your monitor evaluates the last 1 hour, you can set a resolution
244
244
245
245
To fully leverage metrics monitor alerts, you'll need:
246
246
247
-
***Automation Service**. Required for linking playbooks to metrics-based monitors.
247
+
***Automation Service**. Required for linking playbooks to metrics-based monitors ([learn more](#anomaly)).
248
248
***Metrics data**. Our anomaly detection uses up to 30 days of your Sumo Logic metrics data history to establish baseline of the metrics signal and the underlying system behavior.
249
249
***Metrics aggregation**. Queries should be aggregated (for example, using `sum` or `avg` operators) before applying anomaly detection.
250
250
@@ -255,7 +255,7 @@ Examples:
255
255
256
256
#### Static detection method
257
257
258
-
**Metrics - Static - Critical and Warning**
258
+
**Example: Metrics - Static - Critical and Warning**
@@ -287,7 +287,7 @@ The Alert and recovery setting affects both the alert generation logic and the a
287
287
288
288
For example, you want to be alerted when the CPU usage is over 60% `at all times` within a 5-minute window. If you set the count to 3, this means that you will only get an alert if you have at least 3 data points showing CPU usage above 60% within that 5-minute window. If you only have 2 data points, even if both of them show CPU usage above 60%, you won't get an alert.
@@ -304,7 +304,7 @@ For example, you want to be alerted when the CPU usage is over 60% `at all times
304
304
Anomaly detection applies to one time series at a time. All metrics anomaly monitor trigger queries must have aggregation applied at the end of the query before detection.
@@ -315,7 +315,7 @@ Anomaly detection applies to one time series at a time. All metrics anomaly moni
315
315
316
316
Tune the number of anomalous data points detected per day compared to the predicted baseline for the detection window. Select more alerts if you do not want to miss out on most anomalies.
@@ -342,7 +342,7 @@ For recovery, Sumo Logic will automatically resolves the incident when the resol
342
342
343
343
The recovery condition will always be the opposite of the alerting condition. For example, if there is no outlier identified for the duration of the detection window from the time the alert was first fired, then the Monitor will be brought back to the normal state. You cannot customize the resolution condition for the Monitor.
0 commit comments