Skip to content
Merged
Show file tree
Hide file tree
Changes from 4 commits
Commits
Show all changes
29 commits
Select commit Hold shift + click to select a range
6f81bc1
AI-Driven Alerts for Metrics Anomalies (GA 10/15)
kimsauce Sep 23, 2024
255db55
Merge branch 'main' into DOCS-453
kimsauce Sep 23, 2024
be6a849
more edits
kimsauce Sep 24, 2024
ddb9397
Merge branch 'DOCS-453' of github.com:SumoLogic/sumologic-documentati…
kimsauce Sep 24, 2024
f899351
Update docs/metrics/metrics-anomalies-alerts.md
kimsauce Sep 24, 2024
1ae3741
Merge branch 'main' into DOCS-453
kimsauce Sep 29, 2024
832f8eb
Merge branch 'main' into DOCS-453
kimsauce Sep 29, 2024
326ed64
Merge branch 'DOCS-453' of github.com:SumoLogic/sumologic-documentati…
kimsauce Sep 29, 2024
251cd1d
edits
kimsauce Sep 29, 2024
fd6a092
Update docs/alerts/monitors/create-monitor.md
kimsauce Sep 29, 2024
2d6c1a2
Merge branch 'main' into DOCS-453
kimsauce Oct 1, 2024
20bdd53
Merge branch 'DOCS-453' of github.com:SumoLogic/sumologic-documentati…
kimsauce Oct 1, 2024
820db41
more fixes
kimsauce Oct 2, 2024
23b3d9f
Update docs/alerts/monitors/create-monitor.md
kimsauce Oct 2, 2024
768d61e
Update docs/alerts/monitors/create-monitor.md
kimsauce Oct 2, 2024
ed5721e
Update docs/alerts/monitors/create-monitor.md
kimsauce Oct 2, 2024
5140501
Update docs/alerts/monitors/create-monitor.md
kimsauce Oct 3, 2024
b1dd325
Update 2024-10-15-alerts.md
banant Oct 4, 2024
2b653e0
Merge branch 'main' into DOCS-453
kimsauce Oct 4, 2024
bf89d43
feedback changes
kimsauce Oct 4, 2024
eab3729
Merge branch 'main' into DOCS-453
kimsauce Oct 17, 2024
e20657e
Update docs/alerts/monitors/create-monitor.md
kimsauce Oct 17, 2024
d816812
edits
kimsauce Oct 17, 2024
477f848
Update docs/alerts/monitors/create-monitor.md
kimsauce Oct 17, 2024
5d2c437
Update create-monitor.md
kimsauce Oct 17, 2024
8385eea
Rename 2024-10-15-alerts.md to 2024-10-21-alerts.md
kimsauce Oct 17, 2024
6f47e95
Update 2024-10-21-alerts.md
kimsauce Oct 17, 2024
edad7bd
Merge branch 'main' into DOCS-453
jpipkin1 Oct 22, 2024
baf72ef
Change release note date to Oct 22 2024
jpipkin1 Oct 22, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
31 changes: 31 additions & 0 deletions blog-service/2024-10-15-metrics-alerts.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
---
title: AI-Driven Alerts for Metrics Anomalies (Monitors)
image: https://help.sumologic.com/img/sumo-square.png
keywords:
- metrics
- monitors
- alerts
- anomalies
- ai
hide_table_of_contents: true
---

import useBaseUrl from '@docusaurus/useBaseUrl';

<a href="https://help.sumologic.com/release-notes-service/rss.xml"><img src={useBaseUrl('img/release-notes/rss-orange2.png')} alt="icon" width="50"/></a>

We are excited to announce the general availability of AI-driven alerts for metrics anomalies. This release extends our AI-driven alerting capabilities, previously available for logs-based monitors, to metrics-based monitors, enabling you to respond to critical incidents more effectively.

## Benefits

- **Reduces alert fatigue**. AI-driven anomaly detection minimizes irrelevant alerts, allowing customers to focus on critical incidents.
- **Faster incident resolution**. Automated playbooks enable customers to reduce the mean time to resolution by automating repetitive tasks and gathering key diagnostic information.

## Key features

- **Advanced anomaly detection**. A built-in machine learning model analyzes at least 30 days of historical data to establish baseline behavior, detect seasonality, and automatically tune detection parameters. This helps reduce noise from expected spikes while isolating critical issues.
- **Customizable detection framework**. The "Cluster anomalies" detector allows users to specify custom conditions for incident detection, such as marking an anomaly when a defined number of data points within a time window exceed thresholds.
- **Playbook integration**. Customers can automate incident resolution workflows by associating playbooks with metrics monitors. Playbooks streamline responses by gathering diagnostic data, notifying collaborators, and automating recovery tasks.


<!-- learn more link -->
74 changes: 74 additions & 0 deletions docs/metrics/metrics-anomalies-alerts.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,74 @@
---
id: metrics-anomalies-alerts
title: AI-Driven Alerts for Metrics Anomalies
description: Learn about AI-Driven Alerts for metrics-based monitors, which includes advanced anomaly detection and automated incident resolution through playbooks.
---

<!-- Move entire doc to /monitors/alerts docs -->

AI-driven alerts for metrics-based monitors use machine learning to analyze historical data, detect significant deviations, and filter out irrelevant alerts, reducing alert fatigue and allowing teams to focus on critical issues. These capabilities apply to both logs and metrics, providing a comprehensive monitoring solution.

Integrated playbooks automate incident response by gathering diagnostics, notifying teams, triggering recovery actions, streamlining workflows, and improving response times. The customizable "Cluster anomalies" detector allows users to set specific triggers for spiky systems, further reducing false positives and enabling faster issue resolution.

<details>
<summary>What are AI-driven alerts?</summary>

AI-driven alerts use machine learning to analyze historical data, establish baselines, and detect significant deviations in metrics signals. Seasonality detection reduces false positives, and integrated playbooks automate incident response, helping teams resolve issues faster.

</details>

## Key features

### Advanced anomaly detection

- **Machine learning models**. Use 30 days of historical data to establish baseline behavior and detect deviations.
- **Seasonality detection**. Identify recurring patterns, minimizing false positives from periodic spikes.
- **Auto-tuning**. Automatically adjust detection parameters to balance noise and relevance.
- **Extensible detection framework**. Customizable rules, such as "Cluster anomalies," allow for advanced incident detection based on multiple data points exceeding thresholds within a defined time window.

### Playbook integration

Automate responses to anomalies, including diagnostics, team notifications, and recovery tasks like restarting services or scaling infrastructure by linking playbooks to metrics-based monitors.

## Prerequisites

To fully leverage AI-driven alerts for metrics monitors, you'll need:

- **Automation Service**. Required for linking playbooks to metrics-based monitors.
- **Metrics data**. Metrics data must be sent to Sumo Logic for anomaly detection.
- **Metrics aggregation**. Queries that return multiple time series should be aggregated (e.g., using `sum` or `avg` operators) before applying anomaly detection.



## Create a metrics anomaly monitor

1. Go to **Alerts** > **Monitors**, then click **Add** > **New Monitor**.
1. Under **Trigger Conditions**:
* For **Monitor Type**, select **Metrics**.
* For **Detection Method**, select **Anomaly**.
1. Enter your alert conditions, notification settings by going to [Create a Monitor](docs/alerts/monitors/create-monitor.md) and following steps 2 to the end.

## Examples

* **Cluster anomalies detection**. A user configures alerts for anomalies when 5 out of 10 data points in a 10-minute window exceed the baseline, allowing for precision in spiky environments.
* **Automating resolution with playbooks**. A playbook responds to CPU usage anomalies by gathering logs, notifying teams, and restarting affected servers.

## Limitations

Anomaly detection applies to one time series at a time. Multi-time series queries must be aggregated before detection.

## Getting started with playbooks

1. Visit **Automation Service App Central**.
2. Browse over 500 pre-built playbooks.
3. Clone and customize a playbook based on your requirements.

By leveraging pre-built playbooks, you can quickly automate incident responses, reducing time to resolution.

## More information

* [Automation Service](/docs/platform-services/automation-service)
* [Automated Playbooks](/docs/alerts/monitors/use-playbooks-with-monitors/)
* [Create a Monitor](/docs/alerts/monitors/create-monitor)
* [App Central](/docs/platform-services/automation-service/app-central)
* [Metrics Operators](/docs/metrics/metrics-operators)