You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-create-image-labeling-projects.md
+24-25Lines changed: 24 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ ms.reviewer: vkann
8
8
ms.service: azure-machine-learning
9
9
ms.subservice: mldata
10
10
ms.topic: how-to
11
-
ms.date: 02/01/2024
11
+
ms.date: 09/24/2024
12
12
ms.custom: data4ml
13
13
monikerRange: 'azureml-api-1 || azureml-api-2'
14
14
# Customer intent: As a project manager, I want to set up a project to label images in the project. I want to enable machine learning-assisted labeling to help with the task.
@@ -28,7 +28,7 @@ You can also use the data labeling tool in Azure Machine Learning to [create a t
28
28
29
29
Azure Machine Learning data labeling is a tool you can use to create, manage, and monitor data labeling projects. Use it to:
30
30
31
-
- Coordinate data, labels, and team members to efficiently manage labeling tasks.
31
+
- Coordinate data, labels, and team members to efficiently manage the labeling tasks.
32
32
- Track progress and maintain the queue of incomplete labeling tasks.
33
33
- Start and stop the project, and control the labeling progress.
34
34
- Review and export the labeled data as an Azure Machine Learning dataset.
@@ -38,20 +38,20 @@ Azure Machine Learning data labeling is a tool you can use to create, manage, an
38
38
39
39
Image data can be any file that has one of these file extensions:
40
40
41
-
-*.jpg*
42
-
-*.jpeg*
43
-
-*.png*
44
-
-*.jpe*
45
-
-*.jfif*
46
-
-*.bmp*
47
-
-*.tif*
48
-
-*.tiff*
49
-
-*.dcm*
50
-
-*.dicom*
41
+
-`.jpg`
42
+
-`.jpeg`
43
+
-`.png`
44
+
-`.jpe`
45
+
-`.jfif`
46
+
-`.bmp`
47
+
-`.tif`
48
+
-`.tiff`
49
+
-`.dcm`
50
+
-`.dicom`
51
51
52
52
Each file is an item to be labeled.
53
53
54
-
You can also use an MLTable data asset as input to an image labeling project, as long as the images in the table are one of the above formats. For more information, see [How to use MLTable data assets](./how-to-mltable.md).
54
+
You can also use an `MLTable` data asset as input to an image labeling project, as long as the images in the table are one of the above formats. For more information, see [How to use `MLTable` data assets](./how-to-mltable.md).
55
55
56
56
## Prerequisites
57
57
@@ -76,10 +76,10 @@ You use these items to set up image labeling in Azure Machine Learning:
76
76
* To apply only a *single label* to an image from a set of labels, select **Image Classification Multi-class**.
77
77
* To apply *one or more* labels to an image from a set of labels, select **Image Classification Multi-label**. For example, a photo of a dog might be labeled with both *dog* and *daytime*.
78
78
* To assign a label to each object within an image and add bounding boxes, select **Object Identification (Bounding Box)**.
79
-
* To assign a label to each object within an image and draw a polygon around each object, select **Instance Segmentation (Polygon)**.
79
+
* To assign a label to each object within an image and draw a polygon around each object, select **Polygon (Instance Segmentation)**.
80
80
* To draw masks on an image and assign a label class at the pixel level, select **Semantic Segmentation (Preview)**.
81
81
82
-
:::image type="content" source="media/how-to-create-labeling-projects/labeling-creation-wizard.png" alt-text="Screenshot that shows creating a labeling project to manage labeling.":::
82
+
:::image type="content" source="media/how-to-create-labeling-projects/labeling-creation-wizard.png" alt-text="Screenshot that shows creating a labeling project to manage the labeling task.":::
83
83
84
84
1. Select **Next** to continue.
85
85
@@ -98,7 +98,7 @@ You can also select **Create a dataset** to use an existing Azure datastore or t
98
98
99
99
### Data column mapping (preview)
100
100
101
-
If you select an MLTable data asset, an additional**Data Column Mapping** step appears for you to specify the column that contains the image URLs.
101
+
If you select a MLTable data asset, another**Data Column Mapping** step appears for you to specify the column that contains the image URLs.
@@ -169,7 +169,7 @@ For bounding boxes, important questions include:
169
169
* How should labelers handle an object that isn't the object class of interest but has visual similarities to a relevant object type?
170
170
171
171
> [!NOTE]
172
-
> Labelers can select the first nine labels by using number keys 1 through 9.
172
+
> Labelers can select the first nine labels by using number keys 1 through 9. You might want to include this information in your instructions.
173
173
174
174
## Quality control (preview)
175
175
@@ -180,7 +180,7 @@ For bounding boxes, important questions include:
180
180
181
181
## Use ML-assisted data labeling
182
182
183
-
To accelerate labeling tasks, on the **ML assisted labeling** page, you can trigger automatic machine learning models. Medical images (files that have a *.dcm* extension) aren't included in assisted labeling. If the project type is **Semantic Segmentation (Preview)**, ML-assisted labeling isn't available.
183
+
To accelerate labeling tasks, on the **ML assisted labeling** page, you can trigger automatic machine learning models. Medical images (files that have a `.dcm` extension) aren't included in assisted labeling. If the project type is **Semantic Segmentation (Preview)**, ML-assisted labeling isn't available.
184
184
185
185
At the start of your labeling project, the items are shuffled into a random order to reduce potential bias. However, the trained model reflects any biases that are present in the dataset. For example, if 80 percent of your items are of a single class, then approximately 80 percent of the data used to train the model lands in that class.
186
186
@@ -189,9 +189,9 @@ To enable assisted labeling, select **Enable ML assisted labeling** and specify
189
189
ML-assisted labeling consists of two phases:
190
190
191
191
* Clustering
192
-
*Pre-labeling
192
+
*Prelabeling
193
193
194
-
The labeled data item count that's required to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. For some projects, it's sometimes possible to see pre-label or cluster tasks after 300 items have been manually labeled. ML-assisted labeling uses a technique called *transfer learning*. Transfer learning uses a pre-trained model to jump-start the training process. If the classes of your dataset resemble the classes in the pre-trained model, pre-labels might become available after only a few hundred manually labeled items. If your dataset significantly differs from the data that's used to pre-train the model, the process might take more time.
194
+
The labeled data item count that's needed to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. For some projects, it's sometimes possible to see prelabel or cluster tasks after 300 items are manually labeled. ML-assisted labeling uses a technique called *transfer learning*. Transfer learning uses a pretrained model to jump-start the training process. If the classes of your dataset resemble the classes in the pretrained model, prelabels might become available after only a few hundred manually labeled items. If your dataset significantly differs from the data that's used to pretrain the model, the process might take more time.
195
195
196
196
When you use consensus labeling, the consensus label is used for training.
197
197
@@ -208,11 +208,11 @@ After a machine learning model is trained on your manually labeled data, the mod
208
208
209
209
The clustering phase doesn't appear for object detection models or text classification.
210
210
211
-
### Pre-labeling
211
+
### Prelabeling
212
212
213
-
After you submit enough labels for training, either a classification model predicts tags or an object detection model predicts bounding boxes. The labeler now sees pages that contain predicted labels already present on each item. For object detection, predicted boxes are also shown. The task involves reviewing these predictions and correcting any incorrectly labeled images before page submission.
213
+
After you submit enough labels for training, either a classification model predicts tags, or an object detection model predicts bounding boxes. The labeler now sees pages that contain predicted labels already present on each item. For object detection, predicted boxes are also shown. The task involves reviewing these predictions and correcting any incorrectly labeled images before page submission.
214
214
215
-
After a machine learning model is trained on your manually labeled data, the model is evaluated on a test set of manually labeled items. The evaluation helps determine the model's accuracy at different confidence thresholds. The evaluation process sets a confidence threshold beyond which the model is accurate enough to show pre-labels. The model is then evaluated against unlabeled data. Items with predictions that are more confident than the threshold are used for pre-labeling.
215
+
After a machine learning model is trained on your manually labeled data, the model is evaluated on a test set of manually labeled items. The evaluation helps determine the model's accuracy at different confidence thresholds. The evaluation process sets a confidence threshold beyond which the model is accurate enough to show prelabels. The model is then evaluated against unlabeled data. Items with predictions that are more confident than the threshold are used for prelabeling.
216
216
217
217
## Initialize the image labeling project
218
218
@@ -224,8 +224,7 @@ After a machine learning model is trained on your manually labeled data, the mod
Copy file name to clipboardExpand all lines: articles/machine-learning/includes/machine-learning-data-labeling-initialize.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,11 +2,11 @@
2
2
author: sgilley
3
3
ms.service: azure-machine-learning
4
4
ms.topic: include
5
-
ms.date: 10/21/2021
5
+
ms.date: 09/24/2024
6
6
ms.author: sdgilley
7
7
---
8
8
9
9
After the labeling project is initialized, some aspects of the project are immutable. You can't change the task type or dataset. You *can* modify labels and the URL for the task description. Carefully review the settings before you create the project. After you submit the project, you return to the **Data Labeling** overview page, which shows the project as **Initializing**.
10
10
11
11
> [!NOTE]
12
-
> This page might not automatically refresh. After a pause, manually refresh the page to see the project's status as **Created**.
12
+
> The overview page might not automatically refresh. After a pause, manually refresh the page to see the project's status as **Created**.
Copy file name to clipboardExpand all lines: articles/machine-learning/includes/machine-learning-data-labeling-refresh.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
author: sgilley
3
3
ms.service: azure-machine-learning
4
4
ms.topic: include
5
-
ms.date: 12/08/2021
5
+
ms.date: 09/24/2024
6
6
ms.author: sdgilley
7
7
---
8
8
@@ -15,7 +15,7 @@ Select **Enable incremental refresh at regular intervals** when you want your pr
15
15
Clear the selection if you don't want new files in the datastore to automatically be added to your project.
16
16
17
17
> [!IMPORTANT]
18
-
> Don't create a new version for the dataset you want to update. If you do, the updates won't be seen because the data labeling project is pinned to the initial version. Instead, use [Azure Storage Explorer](https://azure.microsoft.com/features/storage-explorer/) to modify your data in the appropriate folder in Blob Storage.
18
+
> When incremental refresh is enabled, don't create a new version for the dataset you want to update. If you do, the updates won't be seen because the data labeling project is pinned to the initial version. Instead, use [Azure Storage Explorer](https://azure.microsoft.com/features/storage-explorer/) to modify your data in the appropriate folder in Blob Storage.
19
19
>
20
20
> Also, don't remove data. Removing data from the dataset your project uses causes an error in the project.
0 commit comments