Skip to content

Commit 9e67886

Browse files
committed
more changes from labeling team
1 parent 92e6454 commit 9e67886

File tree

1 file changed

+11
-8
lines changed

1 file changed

+11
-8
lines changed

articles/machine-learning/how-to-create-labeling-projects.md

Lines changed: 11 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Create a data labeling project
33
titleSuffix: Azure Machine Learning
4-
description: Learn how to create and run labeling projects to tag data for machine learning. The tools include assisted ML, or human in the loop labeling to aid with the task.
4+
description: Learn how to create and run labeling projects to tag data for machine learning. The tools include ml assisted labeling, or human in the loop labeling to aid with the task.
55
author: sdgilley
66
ms.author: sgilley
77
ms.service: machine-learning
@@ -16,11 +16,11 @@ ms.date: 03/01/2020
1616

1717
Labeling voluminous data in machine learning projects is often a headache. Projects that have a computer-vision component, such as image classification or object detection, generally require labels for thousands of images.
1818

19-
[Azure Machine Learning](https://ml.azure.com/) gives you a central place to create, manage, and monitor labeling projects. Use it to coordinate data, labels, and team members to efficiently manage labeling tasks. Machine Learning supports image classification, either multi-label or multi-class, and object identification together with bounded boxes.
19+
[Azure Machine Learning](https://ml.azure.com/) gives you a central place to create, manage, and monitor labeling projects. Use it to coordinate data, labels, and team members to efficiently manage labeling tasks. Machine Learning supports image classification, either multi-label or multi-class, and object identification with bounded boxes.
2020

2121
Machine Learning tracks progress and maintains the queue of incomplete labeling tasks. Labelers don't need an Azure account to participate. After they are authenticated with your Microsoft account or [Azure Active Directory](https://docs.microsoft.com/azure/active-directory/active-directory-whatis), they can do as much labeling as their time allows.
2222

23-
You start and stop the project, add and remove people and teams, and monitor progress. You can export labeled data in COCO format or as an Azure Machine Learning dataset.
23+
You start and stop the project, add and remove labelers and teams, and monitor the labeling progress. You can export labeled data in COCO format or as an Azure Machine Learning dataset.
2424

2525
> [!Important]
2626
> Only image classification and object identification labeling projects are currently supported. Additionally, the data images must be available in an Azure blob datastore. (If you do not have an existing datastore, you may upload images during project creation.)
@@ -121,6 +121,8 @@ For bounding boxes, important questions include:
121121

122122
* How is the bounding box defined for this task? Should it be entirely on the interior of the object, or should it be on the exterior? Should it be cropped as closely as possible, or is some clearance acceptable?
123123
* What level of care and consistency do you expect the labelers to apply in defining bounding boxes?
124+
* How to label the object that is partially shown in the image?
125+
* How to label the object that partially covered by other object?
124126

125127
>[!NOTE]
126128
> Be sure to note that the labelers will be able to select the first 9 labels by using number keys 1-9.
@@ -131,17 +133,18 @@ For bounding boxes, important questions include:
131133

132134
The **ML assisted labeling** page lets you trigger automatic machine learning models to accelerate the labeling task. This feature is available for image classification (multi-class or multi-label) tasks.
133135

134-
Select *Enable ML assisted labeling* and specify a GPU to enable the two phases of assisted labeling:
136+
Select *Enable ML assisted labeling* and specify a GPU to enable assisted labeling, which consists of two phases:
135137

136-
* **Clustering** - after a certain number of labels are submitted, the machine learning model starts to group together similar images. These similar images are presented to the labelers on the same screen to speed up manual tagging. Clustering is most useful when the labeler is viewing multiple images.
138+
* **Clustering** - after a certain number of labels are submitted, the machine learning model starts to group together similar images. These similar images are presented to the labelers on the same screen to speed up manual tagging. Clustering is especially useful when the labeler is viewing a grid of 4, 6, or 9 images.
137139

138140
* **Prelabeling** - after more image labels are submitted, a classification model is used to predict image tags. The labeler now sees pages that contain predicted labels already present on each image. The task is then to review these labels and correct any mis-labeled images before submitting the page.
139141

140-
The exact number of labeled images necessary to start assisted labeling is not a fixed number. The first clustering model is built once there are 350 labeled images, and the first classification model will build with 900 labeled images. But the models aren't used until an accuracy threshold is also met.
142+
The exact number of labeled images necessary to start assisted labeling is not a fixed number. The actual value depends on the number of label classes defined in your project. Labeling service will start to train a model when there are enough labels and use the model to produce either a clustered or prelabeled task.
141143

142144
Since the final labels still rely on input from the labeler, this technology is sometimes called *human in the loop* labeling.
143145

144-
This feature is available only in Enterprise edition workspaces.
146+
> [!NOTE]
147+
> ML assisted labeling is available **only** in Enterprise edition workspaces.
145148
146149
## Initialize the labeling project
147150

@@ -169,7 +172,7 @@ To pause or restart the project, select the **Pause**/**Start** button. You can
169172

170173
You can label data directly from the **Project details** page by selecting **Label data**.
171174

172-
## Add labels to a project
175+
## Add new label class to a project
173176

174177
During the labeling process, you may find that additional labels are needed to classify your images. For example, you may want to add an "Unknown" or "Other" label to indicate confusing images.
175178

0 commit comments

Comments
 (0)