You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- The templates for alerts described here are defined generically for settings like Severity, Aggregation Granularity, Frequency of Evaluation, Condition Type, and so on. You can modify the settings after deployment to meet your needs. See [detailed information about configuring a metric alert rule](../azure-monitor/alerts/alerts-create-new-alert-rule.md) for more information.
31
-
- The templates for metric-based alerts use the **Dynamic threshold** value with [high sensitivity](../azure-monitor/alerts/alerts-dynamic-thresholds.md#dynamic-threshold-sensitivity). You can choose to adjust these settings based on your needs.
31
+
- The templates for metric-based alerts use the **Dynamic threshold** value with [high sensitivity](../azure-monitor/alerts/alerts-dynamic-thresholds.md#known-issues-with-dynamic-threshold-sensitivity). You can choose to adjust these settings based on your needs.
Copy file name to clipboardExpand all lines: articles/azure-monitor/alerts/alerts-create-new-alert-rule.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -102,7 +102,7 @@ Then you define these elements for the resulting alert actions by using:
102
102
|Field |Description |
103
103
|---------|---------|
104
104
|Check every|Select how often the alert rule checks if the condition is met. |
105
-
|Lookback period|Select the how far back to look each time the data is checked. For example, every 1 minute you’ll be looking at the past 5 minutes.|
105
+
|Lookback period|Select how far back to look each time the data is checked. For example, every 1 minute you’ll be looking at the past 5 minutes.|
106
106
107
107
1. (Optional) In the **Advanced options** section, you can specify how many failures within a specific time period will trigger the alert. For example, you can specify that you only want to trigger an alert if there were three failures in the last hour. This setting is defined by your application business policy.
Copy file name to clipboardExpand all lines: articles/azure-monitor/alerts/alerts-dynamic-thresholds.md
+29-34Lines changed: 29 additions & 34 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,39 +12,37 @@ ms.date: 2/23/2022
12
12
13
13
Dynamic thresholds apply advanced machine learning, and use a set of algorithms and methods to:
14
14
- Learn the historical behavior of metrics
15
-
- Analyze metrics over time and identify patterns such as hourly, daily or weekly patterns.
15
+
- Analyze metrics over time and identify patterns such as hourly, daily or weekly patterns
16
16
- Recognize anomalies that indicate possible service issues
17
17
- Calculate the most appropriate threshold for the metric
18
18
19
19
When you use dynamic thresholds, you don't have to know the "right" threshold for each metric, because dynamic thresholds calculate the most appropriate thresholds for you.
20
20
21
21
Dynamic thresholds help you:
22
22
- Create scalable alerts for hundreds of metric series with one alert rule. If you have fewer alert rules, you spend less time creating and managing alerts rules. Scalable alerting is especially useful for multiple dimensions or for multiple resources, such as to all resources in a subscription.
23
-
- Create rules without having to know what threshold to configure. Using dynamic thresholds, you can configure metric alerts using high-level concepts, without having extensive domain knowledge about the metric.
23
+
- Create rules without having to know what threshold to configure. Using dynamic thresholds, you can configure metric alerts using high-level concepts, without having extensive domain knowledge about the metric
24
24
- Configure up metric alerts using high-level concepts without extensive domain knowledge about the metric
25
25
- Prevent noisy (low precision) or wide (low recall) thresholds that don’t have an expected pattern
26
-
- Handle noisy metrics (such as machine CPU or memory) and metrics with low dispersion (such as availability and error rate).
26
+
- Handle noisy metrics (such as machine CPU or memory) and metrics with low dispersion (such as availability and error rate)
27
27
28
-
Dynamic Thresholds can be applied to:
28
+
You can use dynamic thresholds on:
29
29
- most Azure Monitor platform and custom metrics
30
30
- common application and infrastructure metrics
31
31
- noisy metrics, such as machine CPU or memory
32
-
- metrics with low dispersion, such as availability and error rate.
32
+
- metrics with low dispersion, such as availability and error rate
33
33
See [metrics not supported by dynamic thresholds](#metrics-not-supported-by-dynamic-thresholds) for a list of metrics that aren't supported by dynamic thresholds.
34
34
35
-
The system automatically recognizes prolonged outages and removes them from the threshold learning algorithm. If there is a prolonged outage, dynamic thresholds understand the data, and detect system issues with the same level of sensitivity as before the outage occurred.
35
+
The system automatically recognizes prolonged outages and removes them from the threshold learning algorithm. If there's a prolonged outage, dynamic thresholds understand the data, and detect system issues with the same level of sensitivity as before the outage occurred.
When an alert rule is first created, dynamic thresholds use 10 days of historical data to calculate hourly or daily seasonal patterns. The chart that you see in the alert preview reflects that data. After an alert rule is created, dynamic thresholds continually use all available historical data to learn, and adjust the thresholds to be more accurate. After three weeks, dynamic thresholds have enough data to identify weekly patterns as well, and the model is adjusted to include weekly seasonality.
45
44
Alert rules that use dynamic thresholds don't trigger an alert before collecting three days and at least 30 samples of metric data.
46
-
47
-
## How to understand dynamic thresholds charts
45
+
## Understand dynamic thresholds charts
48
46
49
47
The following chart shows a metric, its dynamic thresholds limits, and some alerts that fired when the value was outside the allowed thresholds.
50
48
@@ -61,13 +59,30 @@ Use the following information to interpret the chart:
61
59
-**Red area**: Shows the time when the metric value was outside of the allowed range. The alert remains in the active state as long as subsequent measured values are out of the allowed range, but no new alerts are fired.
62
60
-**End of red area**: When the blue line is back inside the allowed values, the red area stops and the measured value line turns blue. The status of the metric alert fired at the time of the red dot with black outline is set to resolved.
63
61
64
-
## Limitations of dynamic thresholds
62
+
## Considerations when using dynamic thresholds
65
63
66
64
- To ensure accurate threshold calculation, alerts using dynamic thresholds don't trigger an alert before collecting three days and at least 30 samples of metric data. Therefore, new resources or resources missing metric data doesn't trigger an alert until enough data is available.
67
65
- Dynamic thresholds need at least three weeks of historical data to detect weekly seasonality. Some detailed patterns, such as bi-hourly or semi-weekly patterns may not be detected.
68
66
- If the behavior of a metric changed recently, the changes won't be immediately reflected in the dynamic threshold upper and lower bounds. The borders are calculated based on metric data from the last 10 days. When you view the Dynamic Threshold borders for a given metric, look at the metric trend in the last week and not only for recent hours or days.
69
67
- Dynamic thresholds are good for detecting significant deviations, as opposed to slowly evolving issues. Slow behavior changes will probably not trigger an alert.
70
68
69
+
## Known issues with dynamic threshold sensitivity
70
+
71
+
- If an alert rule that uses dynamic thresholds is too noisy or fires too much, you may need to reduce the sensitivity of your dynamic thresholds alert rule. Use one of the following options:
72
+
-**Threshold sensitivity:** Set the sensitivity to **Low** to be more tolerant for deviations.
73
+
-**Number of violations (under Advanced settings):** Configure the alert rule to trigger only if several deviations occur within a certain period of time. This setting makes the rule less susceptible to transient deviations.
74
+
75
+
- You may encounter an alert rule that uses dynamic thresholds doesn't fire or isn't sensitive enough, even though it's configured with high sensitivity. This can happen when the metric's distribution is highly irregular. Consider one of the following solutions to fix the issue:
76
+
- Move to monitoring a complementary metric that's suitable for your scenario, if applicable. For example, check for changes in success rate rather than failure rate.
77
+
- Try selecting a different value for **Aggregation granularity (Period)**.
78
+
- Check if there has been a drastic change in the metric behavior in the last 10 days, such as an outage. An abrupt change can affect the upper and lower thresholds calculated for the metric and make them broader. Wait a few days until the outage is no longer taken into the thresholds calculation. You can also edit the alert rule to use the **Ignore data before** option in the **Advanced settings**.
79
+
- If your data has weekly seasonality, but not enough history is available for the metric, the calculated thresholds can result in having broad upper and lower bounds. For example, the calculation can treat weekdays and weekends in the same way and build wide borders that don't always fit the data. This issue should resolve itself after enough metric history is available. Then, the correct seasonality is detected and the calculated thresholds update accordingly.
80
+
81
+
- When a metric value exhibits large fluctuations, dynamic thresholds may build a wide model around the metric values, which can result in a lower or higher boundary than expected. This scenario can happen when:
82
+
- The sensitivity is set to low.
83
+
- The metric exhibits an irregular behavior with high variance, which appears as spikes or dips in the data.
84
+
85
+
Consider making the model less sensitive by choosing a higher sensitivity or selecting a larger **Lookback period**. You can also use the **Ignore data before** option to exclude a recent irregularity from the historical data used to build the model.
71
86
## Configuring dynamic thresholds
72
87
73
88
We recommend configuring alert rules with dynamic thresholds on these metrics:
@@ -86,26 +101,6 @@ When configuring alert rules in the [Azure portal](https://portal.azure.com), fo
86
101
87
102
> [!NOTE]
88
103
> Metric alert rules created through the portal are created in the same resource group as the target resource.
89
-
90
-
## Dynamic threshold sensitivity
91
-
92
-
- If an alert rule that uses dynamic thresholds is too noisy or fires too much, you may need to reduce the sensitivity of your dynamic thresholds alert rule. Use one of the following options:
93
-
-**Threshold sensitivity:** Set the sensitivity to **Low** to be more tolerant for deviations.
94
-
-**Number of violations (under Advanced settings):** Configure the alert rule to trigger only if several deviations occur within a certain period of time. This setting makes the rule less susceptible to transient deviations.
95
-
96
-
- If an alert rule that uses dynamic thresholds doesn't fire or isn't sensitive enough, even though it's configured with high sensitivity. This scenario usually happens when the metric's distribution is highly irregular. Consider one of the following solutions to fix the issue:
97
-
- Move to monitoring a complementary metric that's suitable for your scenario, if applicable. For example, check for changes in success rate rather than failure rate.
98
-
- Try selecting a different value for **Aggregation granularity (Period)**.
99
-
- Check if there has been a drastic change in the metric behavior in the last 10 days, such as an outage. An abrupt change can affect the upper and lower thresholds calculated for the metric and make them broader. Wait a few days until the outage is no longer taken into the thresholds calculation. You can also edit the alert rule to use the **Ignore data before** option in the **Advanced settings**.
100
-
- If your data has weekly seasonality, but not enough history is available for the metric, the calculated thresholds can result in having broad upper and lower bounds. For example, the calculation can treat weekdays and weekends in the same way and build wide borders that don't always fit the data. This issue should resolve itself after enough metric history is available. Then, the correct seasonality is detected and the calculated thresholds update accordingly.
101
-
102
-
- When a metric value exhibits large fluctuations, dynamic thresholds may build a wide model around the metric values, which can result in a lower or higher boundary than expected. This scenario can happen when:
103
-
- The sensitivity is set to low.
104
-
- The metric exhibits an irregular behavior with high variance, which appears as spikes or dips in the data.
105
-
106
-
Consider making the model less sensitive by choosing a higher sensitivity or selecting a larger **Lookback period**. You can also use the **Ignore data before** option to exclude a recent irregularity from the historical data used to build the model.
107
-
108
-
109
104
## Metrics not supported by dynamic thresholds
110
105
111
106
Dynamic thresholds are supported for most metrics, but some metrics can't use dynamic thresholds.
0 commit comments