You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-create-labeling-projects.md
+11-8Lines changed: 11 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
---
2
2
title: Create a data labeling project
3
3
titleSuffix: Azure Machine Learning
4
-
description: Learn how to create and run labeling projects to tag data for machine learning. The tools include assisted ML, or human in the loop labeling to aid with the task.
4
+
description: Learn how to create and run labeling projects to tag data for machine learning. The tools include ml assisted labeling, or human in the loop labeling to aid with the task.
5
5
author: sdgilley
6
6
ms.author: sgilley
7
7
ms.service: machine-learning
@@ -16,11 +16,11 @@ ms.date: 03/01/2020
16
16
17
17
Labeling voluminous data in machine learning projects is often a headache. Projects that have a computer-vision component, such as image classification or object detection, generally require labels for thousands of images.
18
18
19
-
[Azure Machine Learning](https://ml.azure.com/) gives you a central place to create, manage, and monitor labeling projects. Use it to coordinate data, labels, and team members to efficiently manage labeling tasks. Machine Learning supports image classification, either multi-label or multi-class, and object identification together with bounded boxes.
19
+
[Azure Machine Learning](https://ml.azure.com/) gives you a central place to create, manage, and monitor labeling projects. Use it to coordinate data, labels, and team members to efficiently manage labeling tasks. Machine Learning supports image classification, either multi-label or multi-class, and object identification with bounded boxes.
20
20
21
21
Machine Learning tracks progress and maintains the queue of incomplete labeling tasks. Labelers don't need an Azure account to participate. After they are authenticated with your Microsoft account or [Azure Active Directory](https://docs.microsoft.com/azure/active-directory/active-directory-whatis), they can do as much labeling as their time allows.
22
22
23
-
You start and stop the project, add and remove people and teams, and monitor progress. You can export labeled data in COCO format or as an Azure Machine Learning dataset.
23
+
You start and stop the project, add and remove labelers and teams, and monitor the labeling progress. You can export labeled data in COCO format or as an Azure Machine Learning dataset.
24
24
25
25
> [!Important]
26
26
> Only image classification and object identification labeling projects are currently supported. Additionally, the data images must be available in an Azure blob datastore. (If you do not have an existing datastore, you may upload images during project creation.)
@@ -121,6 +121,8 @@ For bounding boxes, important questions include:
121
121
122
122
* How is the bounding box defined for this task? Should it be entirely on the interior of the object, or should it be on the exterior? Should it be cropped as closely as possible, or is some clearance acceptable?
123
123
* What level of care and consistency do you expect the labelers to apply in defining bounding boxes?
124
+
* How to label the object that is partially shown in the image?
125
+
* How to label the object that partially covered by other object?
124
126
125
127
>[!NOTE]
126
128
> Be sure to note that the labelers will be able to select the first 9 labels by using number keys 1-9.
@@ -131,17 +133,18 @@ For bounding boxes, important questions include:
131
133
132
134
The **ML assisted labeling** page lets you trigger automatic machine learning models to accelerate the labeling task. This feature is available for image classification (multi-class or multi-label) tasks.
133
135
134
-
Select *Enable ML assisted labeling* and specify a GPU to enable the two phases of assisted labeling:
136
+
Select *Enable ML assisted labeling* and specify a GPU to enable assisted labeling, which consists of two phases:
135
137
136
-
***Clustering** - after a certain number of labels are submitted, the machine learning model starts to group together similar images. These similar images are presented to the labelers on the same screen to speed up manual tagging. Clustering is most useful when the labeler is viewing multiple images.
138
+
***Clustering** - after a certain number of labels are submitted, the machine learning model starts to group together similar images. These similar images are presented to the labelers on the same screen to speed up manual tagging. Clustering is especially useful when the labeler is viewing a grid of 4, 6, or 9 images.
137
139
138
140
***Prelabeling** - after more image labels are submitted, a classification model is used to predict image tags. The labeler now sees pages that contain predicted labels already present on each image. The task is then to review these labels and correct any mis-labeled images before submitting the page.
139
141
140
-
The exact number of labeled images necessary to start assisted labeling is not a fixed number. The first clustering model is built once there are 350 labeled images, and the first classification model will build with 900 labeled images. But the models aren't used until an accuracy threshold is also met.
142
+
The exact number of labeled images necessary to start assisted labeling is not a fixed number. The actual value depends on the number of label classes defined in your project. Labeling service will start to train a model when there are enough labels and use the model to produce either a clustered or prelabeled task.
141
143
142
144
Since the final labels still rely on input from the labeler, this technology is sometimes called *human in the loop* labeling.
143
145
144
-
This feature is available only in Enterprise edition workspaces.
146
+
> [!NOTE]
147
+
> ML assisted labeling is available **only** in Enterprise edition workspaces.
145
148
146
149
## Initialize the labeling project
147
150
@@ -169,7 +172,7 @@ To pause or restart the project, select the **Pause**/**Start** button. You can
169
172
170
173
You can label data directly from the **Project details** page by selecting **Label data**.
171
174
172
-
## Add labels to a project
175
+
## Add new label class to a project
173
176
174
177
During the labeling process, you may find that additional labels are needed to classify your images. For example, you may want to add an "Unknown" or "Other" label to indicate confusing images.
0 commit comments