-
Notifications
You must be signed in to change notification settings - Fork 228
DOCS-453 - Metrics Anomaly option #4577
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Changes from 4 commits
Commits
Show all changes
29 commits
Select commit
Hold shift + click to select a range
6f81bc1
AI-Driven Alerts for Metrics Anomalies (GA 10/15)
kimsauce 255db55
Merge branch 'main' into DOCS-453
kimsauce be6a849
more edits
kimsauce ddb9397
Merge branch 'DOCS-453' of github.com:SumoLogic/sumologic-documentati…
kimsauce f899351
Update docs/metrics/metrics-anomalies-alerts.md
kimsauce 1ae3741
Merge branch 'main' into DOCS-453
kimsauce 832f8eb
Merge branch 'main' into DOCS-453
kimsauce 326ed64
Merge branch 'DOCS-453' of github.com:SumoLogic/sumologic-documentati…
kimsauce 251cd1d
edits
kimsauce fd6a092
Update docs/alerts/monitors/create-monitor.md
kimsauce 2d6c1a2
Merge branch 'main' into DOCS-453
kimsauce 20bdd53
Merge branch 'DOCS-453' of github.com:SumoLogic/sumologic-documentati…
kimsauce 820db41
more fixes
kimsauce 23b3d9f
Update docs/alerts/monitors/create-monitor.md
kimsauce 768d61e
Update docs/alerts/monitors/create-monitor.md
kimsauce ed5721e
Update docs/alerts/monitors/create-monitor.md
kimsauce 5140501
Update docs/alerts/monitors/create-monitor.md
kimsauce b1dd325
Update 2024-10-15-alerts.md
banant 2b653e0
Merge branch 'main' into DOCS-453
kimsauce bf89d43
feedback changes
kimsauce eab3729
Merge branch 'main' into DOCS-453
kimsauce e20657e
Update docs/alerts/monitors/create-monitor.md
kimsauce d816812
edits
kimsauce 477f848
Update docs/alerts/monitors/create-monitor.md
kimsauce 5d2c437
Update create-monitor.md
kimsauce 8385eea
Rename 2024-10-15-alerts.md to 2024-10-21-alerts.md
kimsauce 6f47e95
Update 2024-10-21-alerts.md
kimsauce edad7bd
Merge branch 'main' into DOCS-453
jpipkin1 baf72ef
Change release note date to Oct 22 2024
jpipkin1 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,31 @@ | ||
| --- | ||
| title: AI-Driven Alerts for Metrics Anomalies (Monitors) | ||
| image: https://help.sumologic.com/img/sumo-square.png | ||
| keywords: | ||
| - metrics | ||
| - monitors | ||
| - alerts | ||
| - anomalies | ||
| - ai | ||
| hide_table_of_contents: true | ||
| --- | ||
|
|
||
| import useBaseUrl from '@docusaurus/useBaseUrl'; | ||
|
|
||
| <a href="https://help.sumologic.com/release-notes-service/rss.xml"><img src={useBaseUrl('img/release-notes/rss-orange2.png')} alt="icon" width="50"/></a> | ||
|
|
||
| We are excited to announce the general availability of AI-driven alerts for metrics anomalies. This release extends our AI-driven alerting capabilities, previously available for logs-based monitors, to metrics-based monitors, enabling you to respond to critical incidents more effectively. | ||
|
|
||
| ## Benefits | ||
|
|
||
| - **Reduces alert fatigue**. AI-driven anomaly detection minimizes irrelevant alerts, allowing customers to focus on critical incidents. | ||
| - **Faster incident resolution**. Automated playbooks enable customers to reduce the mean time to resolution by automating repetitive tasks and gathering key diagnostic information. | ||
|
|
||
| ## Key features | ||
|
|
||
| - **Advanced anomaly detection**. A built-in machine learning model analyzes at least 30 days of historical data to establish baseline behavior, detect seasonality, and automatically tune detection parameters. This helps reduce noise from expected spikes while isolating critical issues. | ||
| - **Customizable detection framework**. The "Cluster anomalies" detector allows users to specify custom conditions for incident detection, such as marking an anomaly when a defined number of data points within a time window exceed thresholds. | ||
| - **Playbook integration**. Customers can automate incident resolution workflows by associating playbooks with metrics monitors. Playbooks streamline responses by gathering diagnostic data, notifying collaborators, and automating recovery tasks. | ||
|
|
||
|
|
||
| <!-- learn more link --> |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,74 @@ | ||
| --- | ||
| id: metrics-anomalies-alerts | ||
| title: AI-Driven Alerts for Metrics Anomalies | ||
| description: Learn about AI-Driven Alerts for metrics-based monitors, which includes advanced anomaly detection and automated incident resolution through playbooks. | ||
| --- | ||
|
|
||
| <!-- Move entire doc to /monitors/alerts docs --> | ||
|
|
||
| AI-driven alerts for metrics-based monitors use machine learning to analyze historical data, detect significant deviations, and filter out irrelevant alerts, reducing alert fatigue and allowing teams to focus on critical issues. These capabilities apply to both logs and metrics, providing a comprehensive monitoring solution. | ||
|
|
||
| Integrated playbooks automate incident response by gathering diagnostics, notifying teams, triggering recovery actions, streamlining workflows, and improving response times. The customizable "Cluster anomalies" detector allows users to set specific triggers for spiky systems, further reducing false positives and enabling faster issue resolution. | ||
|
|
||
| <details> | ||
| <summary>What are AI-driven alerts?</summary> | ||
|
|
||
| AI-driven alerts use machine learning to analyze historical data, establish baselines, and detect significant deviations in metrics signals. Seasonality detection reduces false positives, and integrated playbooks automate incident response, helping teams resolve issues faster. | ||
|
|
||
| </details> | ||
|
|
||
| ## Key features | ||
|
|
||
| ### Advanced anomaly detection | ||
|
|
||
| - **Machine learning models**. Use 30 days of historical data to establish baseline behavior and detect deviations. | ||
| - **Seasonality detection**. Identify recurring patterns, minimizing false positives from periodic spikes. | ||
| - **Auto-tuning**. Automatically adjust detection parameters to balance noise and relevance. | ||
| - **Extensible detection framework**. Customizable rules, such as "Cluster anomalies," allow for advanced incident detection based on multiple data points exceeding thresholds within a defined time window. | ||
|
|
||
| ### Playbook integration | ||
|
|
||
| Automate responses to anomalies, including diagnostics, team notifications, and recovery tasks like restarting services or scaling infrastructure by linking playbooks to metrics-based monitors. | ||
|
|
||
| ## Prerequisites | ||
|
|
||
| To fully leverage AI-driven alerts for metrics monitors, you'll need: | ||
|
|
||
| - **Automation Service**. Required for linking playbooks to metrics-based monitors. | ||
| - **Metrics data**. Metrics data must be sent to Sumo Logic for anomaly detection. | ||
| - **Metrics aggregation**. Queries that return multiple time series should be aggregated (e.g., using `sum` or `avg` operators) before applying anomaly detection. | ||
|
|
||
|
|
||
|
|
||
| ## Create a metrics anomaly monitor | ||
|
|
||
| 1. Go to **Alerts** > **Monitors**, then click **Add** > **New Monitor**. | ||
| 1. Under **Trigger Conditions**: | ||
| * For **Monitor Type**, select **Metrics**. | ||
| * For **Detection Method**, select **Anomaly**. | ||
| 1. Enter your alert conditions, notification settings by going to [Create a Monitor](docs/alerts/monitors/create-monitor.md) and following steps 2 to the end. | ||
|
|
||
| ## Examples | ||
|
|
||
| * **Cluster anomalies detection**. A user configures alerts for anomalies when 5 out of 10 data points in a 10-minute window exceed the baseline, allowing for precision in spiky environments. | ||
| * **Automating resolution with playbooks**. A playbook responds to CPU usage anomalies by gathering logs, notifying teams, and restarting affected servers. | ||
|
|
||
| ## Limitations | ||
|
|
||
| Anomaly detection applies to one time series at a time. Multi-time series queries must be aggregated before detection. | ||
|
|
||
| ## Getting started with playbooks | ||
|
|
||
| 1. Visit **Automation Service App Central**. | ||
| 2. Browse over 500 pre-built playbooks. | ||
| 3. Clone and customize a playbook based on your requirements. | ||
|
|
||
| By leveraging pre-built playbooks, you can quickly automate incident responses, reducing time to resolution. | ||
|
|
||
| ## More information | ||
|
|
||
| * [Automation Service](/docs/platform-services/automation-service) | ||
| * [Automated Playbooks](/docs/alerts/monitors/use-playbooks-with-monitors/) | ||
| * [Create a Monitor](/docs/alerts/monitors/create-monitor) | ||
| * [App Central](/docs/platform-services/automation-service/app-central) | ||
| * [Metrics Operators](/docs/metrics/metrics-operators) | ||
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.