|
1 |
| -Azure Monitor can proactively notify you when specific conditions are found in your monitoring data. You can create an alert based on any metric or log data source in the Azure Monitor data platform. There are different types of alerts: |
| 1 | +Azure Monitor can proactively notify you when it finds specific conditions in your monitoring data. You can create an alert based on any metric or log data source in the Azure Monitor data platform. |
2 | 2 |
|
3 |
| -- Metric alerts evaluate resource metrics at regular intervals. Metrics can be platform metrics, custom metrics, logs from Azure Monitor converted to metrics, or Application Insights metrics. Metric alerts can also apply multiple conditions and dynamic thresholds. |
4 |
| -- Log alerts allow users to use a Log Analytics query to evaluate resource logs at a predefined frequency. |
5 |
| -- Activity log alerts trigger when a new activity log event occurs that matches defined conditions. Resource Health alerts and Service Health alerts are activity log alerts that report on your service and resource health. |
| 3 | +There are various types of alerts: |
6 | 4 |
|
7 |
| -When monitoring an Azure Machine Learning workspace, you might want to get an alert when a Model deploy fails, when quota utilization exceeds a threshold, or when there are one or more unusable nodes. To create an alert: |
| 5 | +- *Metric alerts* evaluate resource metrics at regular intervals. Metrics can be platform metrics, custom metrics, logs from Azure Monitor converted to metrics, or Application Insights metrics. Metric alerts can also apply multiple conditions and dynamic thresholds. |
| 6 | +- *Log alerts* enable users to use a Log Analytics query to evaluate resource logs at a predefined frequency. |
| 7 | +- *Activity log alerts* trigger when a new activity log event matches defined conditions. For example, activity log alerts can report on your service health and resource health. |
8 | 8 |
|
9 |
| -1. On the Azure portal, open the Azure Machine Learning resource. |
10 |
| -1. On the left-hand side menu, expand Monitoring and select Alerts. |
11 |
| -1. Select the + Create drop down menu and select Alert rule. |
12 |
| -1. Notice that the wizard starts on the second tab, Condition. This is because the Scope has already been set to Azure Machine Learning resource. |
13 |
| -1. On the Signal name, select See all signals. |
14 |
| -1. Notice that you can look for a range of signals including Custom log search, Metrics, and Activity log. |
15 |
| -1. Under Metrics, select Failed Runs. |
16 |
| -1. On the Alert logic, you can provide thresholds for when the condition is met to trigger the alert. On the Threshold enter 5. This means more than 5 failed deployments trigger this alert. You can change the variables to meet your needs. Select Next. |
17 |
| -1. In the Actions tab, make sure Use quick actions is selected and provide the details in the right-hand side pane. Select Save and select Next. |
18 |
| -1. On the Details tab, confirm the subscription and Resource group to use. Select the appropriate Severity level. In Alert rule name, provide the name of the alert and a description next. Select Review + Create. |
| 9 | +When you're monitoring an Azure Machine Learning workspace, you might want to get an alert when a model deployment fails, when quota utilization exceeds a threshold, or when nodes are unusable. |
19 | 10 |
|
20 |
| -The Alerts dashboard shows information if the alert has been triggered: |
| 11 | +To create an alert: |
21 | 12 |
|
22 |
| - |
| 13 | +1. In the Azure portal, open the Azure Machine Learning resource. |
23 | 14 |
|
24 |
| -If you configure an Alert to send an e-mail, you should receive a notification: |
| 15 | +1. On the left menu, expand **Monitoring** and select **Alerts**. |
25 | 16 |
|
26 |
| - |
| 17 | +1. On the **Create** dropdown menu, select **Alert rule**. |
| 18 | + |
| 19 | +1. Notice that the wizard starts on the second tab, **Condition**. The reason that the scope is already set to an Azure Machine Learning resource. |
| 20 | + |
| 21 | +1. For **Signal name**, select **See all signals**. |
| 22 | + |
| 23 | + Notice that you can look for a range of signals, including **Custom log search**, **Metrics**, and **Activity log**. |
| 24 | + |
| 25 | +1. Under **Metrics**, select **Failed Runs**. |
| 26 | + |
| 27 | +1. For **Alert logic**, you can provide thresholds for when the condition is met to trigger the alert. For **Threshold**, enter **5**. This value means that more than five failed deployments trigger this alert. You can change the variables to meet your needs. When you finish, select **Next**. |
| 28 | + |
| 29 | +1. On the **Actions** tab, make sure that **Use quick actions** is selected and provide the details on the right pane. Then select **Save** > **Next**. |
| 30 | + |
| 31 | +1. On the **Details** tab, confirm the subscription and resource group to use. Select the appropriate severity level. For **Alert rule name**, provide the name of the alert and a description. Then select **Review + Create**. |
| 32 | + |
| 33 | +The **Alerts** dashboard shows information if the alert is triggered. |
| 34 | + |
| 35 | + |
| 36 | + |
| 37 | +If you configure an alert to send an email, you should receive a notification. |
| 38 | + |
| 39 | + |
0 commit comments