You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this article, you learn how to monitor [Azure Machine Learning online endpoints](concept-endpoints.md). Use Application Insights to view metrics and create alerts to stay up to date with your online endpoints.
18
+
Azure Machine Learning provides several ways to track and monitor metrics and logs regarding [Azure Machine Learning online endpoints](concept-endpoints.md). Integrated with Azure Monitor, you can view metrics in chart, compare between endpoints and deployments, pin to Azure portal dashboards, configure alerts, query from log tables and push the logs to supported targets. You can also use Application Insights to analyze events from user containers.
19
+
20
+
***Metrics**: Endpoint level metrics such as request latency, requests per minute, new connections per second, network bytes, etc. can be drilled down to deployment or status level. Deployment level metrics such as CPU/GPU utilization, memory or disk utilization can be drilled down to instance level. Azure Monitor allows tracking these Metrics in charts and setting up dashboards and alerts for further analysis.
21
+
22
+
***Logs**: You can send Metrics to the Log Analytics Workspace where you can query the logs using rich Kusto query syntax. You can also send Metrics to Storage Account and/or Event Hubs for further processing. In addition, you can use dedicated Log tables for online endpoint related events, traffic, and console (container) logs. Kusto query allows complex analysis joining multiple tables.
23
+
24
+
***Application insights**: Curated environments include the integration with Application Insights, and you can simply enable/disable it when you create an online deployment. Built-in metrics and logs are sent to Application insights, and you can use its built-in features such as Live metrics, Transaction search, Failures and Performance for further analysis.
19
25
20
26
In this article you learn how to:
21
27
22
28
> [!div class="checklist"]
29
+
> * Choose the right method to view and track metrics and logs
23
30
> * View metrics for your online endpoint
24
31
> * Create a dashboard for your metrics
25
32
> * Create a metric alert
33
+
> * View logs for your online endpoint
34
+
> * Use application insights (legacy) to track metrics and logs
26
35
27
36
> [!IMPORTANT]
28
37
> Items marked (preview) in this article are currently in public preview.
@@ -36,7 +45,7 @@ In this article you learn how to:
36
45
37
46
## Metrics
38
47
39
-
You can view metrics pages for online endpoints or deployments in the Azure portal. An easy way to access these metrics pages is through links available in the Azure Machine Learning studio user interface—specifically in the **Details** tab of an endpoint's page. Following these links will take you to the exact metrics page in the Azure portal for the endpoint or deployment. Alternatively, you can also go into the Azure portal to search for the metrics page for the endpoint or deployment.
48
+
You can view metrics pages for online endpoints or deployments in the Azure portal. An easy way to access these metrics pages is through links available in the Azure Machine Learning studio user interface — specifically in the **Details** tab of an endpoint's page. Following these links will take you to the exact metrics page in the Azure portal for the endpoint or deployment. Alternatively, you can also go into the Azure portal to search for the metrics page for the endpoint or deployment.
40
49
41
50
To access the metrics pages through links available in the studio:
42
51
@@ -79,11 +88,14 @@ Split on the following dimensions:
79
88
- Status Code Class
80
89
- Model Status Code
81
90
91
+
For instance, using splits you can compare request latency and/or requests per minute between different deployments under an endpoint.
92
+
82
93
**Bandwidth throttling**
83
94
84
-
Bandwidth will be throttled if the limits are exceeded for _managed_ online endpoints (see managed online endpoints section in [Manage and increase quotas for resources with Azure Machine Learning](how-to-manage-quotas.md#azure-machine-learning-managed-online-endpoints)). To determine if requests are throttled:
95
+
Bandwidth will be throttled if the limits are exceeded for _managed_ online endpoints quota (see managed online endpoints section in [Manage and increase quotas for resources with Azure Machine Learning](how-to-manage-quotas.md#azure-machine-learning-managed-online-endpoints)). To determine if requests are throttled:
85
96
- Monitor the "Network bytes" metric
86
97
- The response trailers will have the fields: `ms-azureml-bandwidth-request-delay-ms` and `ms-azureml-bandwidth-response-delay-ms`. The values of the fields are the delays, in milliseconds, of the bandwidth throttling.
98
+
For more information, see [Bandwidth limit issues](how-to-troubleshoot-online-endpoints.md#bandwidth-limit-issues).
87
99
88
100
#### Metrics at deployment scope
89
101
@@ -98,25 +110,8 @@ Split on the following dimension:
98
110
99
111
- InstanceId
100
112
101
-
### Create a dashboard
102
-
103
-
You can create custom dashboards to visualize data from multiple sources in the Azure portal, including the metrics for your online endpoint. For more information, see [Create custom KPI dashboards using Application Insights](../azure-monitor/app/tutorial-app-dashboards.md#add-custom-metric-chart).
104
-
105
-
### Create an alert
106
-
107
-
You can also create custom alerts to notify you of important status updates to your online endpoint:
108
-
109
-
1. At the top right of the metrics page, select **New alert rule**.
113
+
For instance, you can compare CPU and/or memory utilization between difference instances for an online deployment.
110
114
111
-
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-new-alert-rule.png" alt-text="Monitoring online endpoints: screenshot showing 'New alert rule' button surrounded by a red box":::
112
-
113
-
1. Select a condition name to specify when your alert should be triggered.
114
-
115
-
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-configure-signal-logic.png" alt-text="Monitoring online endpoints: screenshot showing 'Configure signal logic' button surrounded by a red box":::
116
-
117
-
1. Select **Add action groups** > **Create action groups** to specify what should happen when your alert is triggered.
118
-
119
-
1. Choose **Create alert rule** to finish creating your alert.
120
115
121
116
## Logs
122
117
@@ -245,6 +240,37 @@ The following tables provide details on the data stored in each log:
245
240
| Message | The content of the event.
246
241
247
242
243
+
## Using Application Insights
244
+
245
+
When your online deployment uses curated docker image, it includes the integration with Application Insights. To activate log collection using Application Insights, you can simply enable it when you create an online deployment. Built-in metrics and logs are sent to Application insights, and you can use its built-in features such as Live metrics, Transaction search, Failures and Performance.
246
+
247
+
248
+
## Create dashboards and alerts
249
+
250
+
Azure Monitor allows creating dashboards and alerts based on metrics.
251
+
252
+
### Create dashboards and visualize queries
253
+
254
+
You can create custom dashboards and visualize metrics from multiple sources in the Azure portal, including the metrics for your online endpoint. For more information on creating dashboards and visualizing queries, see [Dashboards using log data](../azure-monitor/visualize/tutorial-logs-dashboards.md) and [Dashboards using application data](../azure-monitor/app/tutorial-app-dashboards.md).
255
+
256
+
### Create alerts
257
+
258
+
You can also create custom alerts to notify you of important status updates to your online endpoint:
259
+
260
+
1. At the top right of the metrics page, select **New alert rule**.
261
+
262
+
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-new-alert-rule.png" alt-text="Sscreenshot showing 'New alert rule' button surrounded by a red box.":::
263
+
264
+
1. Select a condition name to specify when your alert should be triggered.
265
+
266
+
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-configure-signal-logic.png" alt-text="Screenshot showing 'Configure signal logic' button surrounded by a red box.":::
267
+
268
+
1. Select **Add action groups** > **Create action groups** to specify what should happen when your alert is triggered.
269
+
270
+
1. Choose **Create alert rule** to finish creating your alert.
271
+
272
+
For more information, see [Create Azure Monitor alert rules](../azure-monitor/alerts/alerts-create-new-alert-rule.md).
0 commit comments