Skip to content

Commit 2368426

Browse files
authored
Update how-to-monitor-datasets.md
1 parent 13d2b9a commit 2368426

File tree

1 file changed

+3
-38
lines changed

1 file changed

+3
-38
lines changed

articles/machine-learning/v1/how-to-monitor-datasets.md

Lines changed: 3 additions & 38 deletions
Original file line numberDiff line numberDiff line change
@@ -53,44 +53,9 @@ To create and work with dataset monitors, you need:
5353
* Structured (tabular) data with a timestamp specified in the file path, file name, or column in the data.
5454

5555
## Prerequisites (Migrate to Model Monitor)
56-
When you migrate to Model Monitor, please check the prerequisites as following:
56+
When you migrate to Model Monitor, please check the prerequisites as mentioned in this article [Prerequisites of Azure Machine Learning model monitoring](../how-to-monitor-model-performance.md#prerequisites).
5757

5858

59-
# [Python SDK](#tab/python)
60-
61-
[!INCLUDE [basic prereqs sdk](../includes/machine-learning-sdk-v2-prereqs.md)]
62-
63-
# [Studio](#tab/azure-studio)
64-
65-
Before following the steps in this article, make sure you have the following prerequisites:
66-
67-
* An Azure subscription. If you don't have an Azure subscription, create a free account before you begin. Try the [free or paid version of Azure Machine Learning](https://azure.microsoft.com/free/).
68-
69-
* An Azure Machine Learning workspace and a compute instance. If you don't have these resources, use the steps in the [Quickstart: Create workspace resources](../quickstart-create-resources.md) article to create them.
70-
71-
# [Azure CLI](#tab/azure-cli)
72-
73-
[!INCLUDE [basic prereqs cli](../includes/machine-learning-cli-prereqs.md)]
74-
---
75-
76-
* Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure Machine Learning. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure Machine Learning workspace, or a custom role allowing `Microsoft.MachineLearningServices/workspaces/onlineEndpoints/*`. For more information, see [Manage access to an Azure Machine Learning workspace](../how-to-assign-roles.md).
77-
78-
* For monitoring a model that is deployed to an Azure Machine Learning online endpoint (managed online endpoint or Kubernetes online endpoint), be sure to:
79-
80-
* Have a model already deployed to an Azure Machine Learning online endpoint. Both managed online endpoint and Kubernetes online endpoint are supported. If you don't have a model deployed to an Azure Machine Learning online endpoint, see [Deploy and score a machine learning model by using an online endpoint](../how-to-deploy-online-endpoints.md).
81-
82-
* Enable data collection for your model deployment. You can enable data collection during the deployment step for Azure Machine Learning online endpoints. For more information, see [Collect production data from models deployed to a real-time endpoint](../how-to-collect-production-data.md).
83-
84-
* For monitoring a model that is deployed to an Azure Machine Learning batch endpoint or deployed outside of Azure Machine Learning, be sure to:
85-
86-
* Have a means to collect production data and register it as an Azure Machine Learning data asset.
87-
* Update the registered data asset continuously for model monitoring.
88-
* (Recommended) Register the model in an Azure Machine Learning workspace, for lineage tracking.
89-
90-
> [!IMPORTANT]
91-
>
92-
> Model monitoring jobs are scheduled to run on serverless Spark compute pools with support for the following VM instance types: `Standard_E4s_v3`, `Standard_E8s_v3`, `Standard_E16s_v3`, `Standard_E32s_v3`, and `Standard_E64s_v3`. You can select the VM instance type with the `create_monitor.compute.instance_type` property in your YAML configuration or from the dropdown in the Azure Machine Learning studio.
93-
9459
## What is data drift?
9560

9661
Model accuracy degrades over time, largely because of data drift. For machine learning models, data drift is the change in model input data that leads to model performance degradation. Monitoring data drift helps detect these model performance issues.
@@ -233,7 +198,7 @@ The **backfill** function runs a backfill job, for a specified start and end dat
233198
> Azure Machine Learning model monitoring doesn't support manual **backfill** function, if you want to redo the model monitor for a specif time range, you can create another model monitor for that specific time range.
234199
235200
# [Python SDK](#tab/python)
236-
201+
<a name="sdk-monitor"></a>
237202

238203
[!INCLUDE [sdk v1](../includes/machine-learning-sdk-v1.md)]
239204

@@ -290,7 +255,7 @@ monitor = monitor.enable_schedule()
290255
291256

292257
# [Studio](#tab/azure-studio)
293-
258+
<a name="studio-monitor"></a>
294259

295260
1. Navigate to the [studio's homepage](https://ml.azure.com).
296261
1. Select the **Data** tab.

0 commit comments

Comments
 (0)