Skip to content

Commit f68e99f

Browse files
Merge pull request #229860 from dem108/patch-18
Clarify monitoring options
2 parents c604590 + c812bc6 commit f68e99f

File tree

1 file changed

+48
-22
lines changed

1 file changed

+48
-22
lines changed

articles/machine-learning/how-to-monitor-online-endpoints.md

Lines changed: 48 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -8,21 +8,30 @@ ms.reviewer: mopeakande
88
author: Bozhong68
99
ms.author: bozhlin
1010
ms.subservice: mlops
11-
ms.date: 08/29/2022
11+
ms.date: 03/07/2023
1212
ms.topic: conceptual
1313
ms.custom: how-to, devplatv2, event-tier1-build-2022
1414
---
1515

1616
# Monitor online endpoints
1717

18-
In this article, you learn how to monitor [Azure Machine Learning online endpoints](concept-endpoints.md). Use Application Insights to view metrics and create alerts to stay up to date with your online endpoints.
18+
Azure Machine Learning provides several ways to track and monitor metrics and logs regarding [Azure Machine Learning online endpoints](concept-endpoints.md). Integrated with Azure Monitor, you can view metrics in chart, compare between endpoints and deployments, pin to Azure portal dashboards, configure alerts, query from log tables and push the logs to supported targets. You can also use Application Insights to analyze events from user containers.
19+
20+
* **Metrics**: Endpoint level metrics such as request latency, requests per minute, new connections per second, network bytes, etc. can be drilled down to deployment or status level. Deployment level metrics such as CPU/GPU utilization, memory or disk utilization can be drilled down to instance level. Azure Monitor allows tracking these Metrics in charts and setting up dashboards and alerts for further analysis.
21+
22+
* **Logs**: You can send Metrics to the Log Analytics Workspace where you can query the logs using rich Kusto query syntax. You can also send Metrics to Storage Account and/or Event Hubs for further processing. In addition, you can use dedicated Log tables for online endpoint related events, traffic, and console (container) logs. Kusto query allows complex analysis joining multiple tables.
23+
24+
* **Application insights**: Curated environments include the integration with Application Insights, and you can simply enable/disable it when you create an online deployment. Built-in metrics and logs are sent to Application insights, and you can use its built-in features such as Live metrics, Transaction search, Failures and Performance for further analysis.
1925

2026
In this article you learn how to:
2127

2228
> [!div class="checklist"]
29+
> * Choose the right method to view and track metrics and logs
2330
> * View metrics for your online endpoint
2431
> * Create a dashboard for your metrics
2532
> * Create a metric alert
33+
> * View logs for your online endpoint
34+
> * Use application insights (legacy) to track metrics and logs
2635
2736
> [!IMPORTANT]
2837
> Items marked (preview) in this article are currently in public preview.
@@ -36,7 +45,7 @@ In this article you learn how to:
3645

3746
## Metrics
3847

39-
You can view metrics pages for online endpoints or deployments in the Azure portal. An easy way to access these metrics pages is through links available in the Azure Machine Learning studio user interfacespecifically in the **Details** tab of an endpoint's page. Following these links will take you to the exact metrics page in the Azure portal for the endpoint or deployment. Alternatively, you can also go into the Azure portal to search for the metrics page for the endpoint or deployment.
48+
You can view metrics pages for online endpoints or deployments in the Azure portal. An easy way to access these metrics pages is through links available in the Azure Machine Learning studio user interfacespecifically in the **Details** tab of an endpoint's page. Following these links will take you to the exact metrics page in the Azure portal for the endpoint or deployment. Alternatively, you can also go into the Azure portal to search for the metrics page for the endpoint or deployment.
4049

4150
To access the metrics pages through links available in the studio:
4251

@@ -79,11 +88,14 @@ Split on the following dimensions:
7988
- Status Code Class
8089
- Model Status Code
8190

91+
For instance, using splits you can compare request latency and/or requests per minute between different deployments under an endpoint.
92+
8293
**Bandwidth throttling**
8394

84-
Bandwidth will be throttled if the limits are exceeded for _managed_ online endpoints (see managed online endpoints section in [Manage and increase quotas for resources with Azure Machine Learning](how-to-manage-quotas.md#azure-machine-learning-managed-online-endpoints)). To determine if requests are throttled:
95+
Bandwidth will be throttled if the limits are exceeded for _managed_ online endpoints quota (see managed online endpoints section in [Manage and increase quotas for resources with Azure Machine Learning](how-to-manage-quotas.md#azure-machine-learning-managed-online-endpoints)). To determine if requests are throttled:
8596
- Monitor the "Network bytes" metric
8697
- The response trailers will have the fields: `ms-azureml-bandwidth-request-delay-ms` and `ms-azureml-bandwidth-response-delay-ms`. The values of the fields are the delays, in milliseconds, of the bandwidth throttling.
98+
For more information, see [Bandwidth limit issues](how-to-troubleshoot-online-endpoints.md#bandwidth-limit-issues).
8799

88100
#### Metrics at deployment scope
89101

@@ -98,25 +110,8 @@ Split on the following dimension:
98110

99111
- InstanceId
100112

101-
### Create a dashboard
102-
103-
You can create custom dashboards to visualize data from multiple sources in the Azure portal, including the metrics for your online endpoint. For more information, see [Create custom KPI dashboards using Application Insights](../azure-monitor/app/tutorial-app-dashboards.md#add-custom-metric-chart).
104-
105-
### Create an alert
106-
107-
You can also create custom alerts to notify you of important status updates to your online endpoint:
108-
109-
1. At the top right of the metrics page, select **New alert rule**.
113+
For instance, you can compare CPU and/or memory utilization between difference instances for an online deployment.
110114

111-
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-new-alert-rule.png" alt-text="Monitoring online endpoints: screenshot showing 'New alert rule' button surrounded by a red box":::
112-
113-
1. Select a condition name to specify when your alert should be triggered.
114-
115-
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-configure-signal-logic.png" alt-text="Monitoring online endpoints: screenshot showing 'Configure signal logic' button surrounded by a red box":::
116-
117-
1. Select **Add action groups** > **Create action groups** to specify what should happen when your alert is triggered.
118-
119-
1. Choose **Create alert rule** to finish creating your alert.
120115

121116
## Logs
122117

@@ -245,6 +240,37 @@ The following tables provide details on the data stored in each log:
245240
| Message | The content of the event.
246241

247242

243+
## Using Application Insights
244+
245+
When your online deployment uses curated docker image, it includes the integration with Application Insights. To activate log collection using Application Insights, you can simply enable it when you create an online deployment. Built-in metrics and logs are sent to Application insights, and you can use its built-in features such as Live metrics, Transaction search, Failures and Performance.
246+
247+
248+
## Create dashboards and alerts
249+
250+
Azure Monitor allows creating dashboards and alerts based on metrics.
251+
252+
### Create dashboards and visualize queries
253+
254+
You can create custom dashboards and visualize metrics from multiple sources in the Azure portal, including the metrics for your online endpoint. For more information on creating dashboards and visualizing queries, see [Dashboards using log data](../azure-monitor/visualize/tutorial-logs-dashboards.md) and [Dashboards using application data](../azure-monitor/app/tutorial-app-dashboards.md).
255+
256+
### Create alerts
257+
258+
You can also create custom alerts to notify you of important status updates to your online endpoint:
259+
260+
1. At the top right of the metrics page, select **New alert rule**.
261+
262+
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-new-alert-rule.png" alt-text="Sscreenshot showing 'New alert rule' button surrounded by a red box.":::
263+
264+
1. Select a condition name to specify when your alert should be triggered.
265+
266+
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-configure-signal-logic.png" alt-text="Screenshot showing 'Configure signal logic' button surrounded by a red box.":::
267+
268+
1. Select **Add action groups** > **Create action groups** to specify what should happen when your alert is triggered.
269+
270+
1. Choose **Create alert rule** to finish creating your alert.
271+
272+
For more information, see [Create Azure Monitor alert rules](../azure-monitor/alerts/alerts-create-new-alert-rule.md).
273+
248274

249275
## Next steps
250276

0 commit comments

Comments
 (0)