Skip to content

Commit 1f92d23

Browse files
authored
Merge pull request #218472 from sdgilley/sdg-labeling
consensus labeling
2 parents f055166 + dc01afc commit 1f92d23

File tree

8 files changed

+72
-4
lines changed

8 files changed

+72
-4
lines changed

articles/machine-learning/how-to-create-image-labeling-projects.md

Lines changed: 29 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -120,9 +120,16 @@ For bounding boxes, important questions include:
120120
* How to label the object if there is no clear boundary of the object?
121121
* How to label the object which is not object class of interest but visually similar to an interested object type?
122122

123-
>[!NOTE]
123+
> [!NOTE]
124124
> Be sure to note that the labelers will be able to select the first 9 labels by using number keys 1-9.
125125
126+
## Quality control (preview)
127+
128+
[!INCLUDE [describe](../../includes/machine-learning-data-labeling-quality-control.md)]
129+
130+
> [!NOTE]
131+
> **Instance Segmentation** projects cannot use consensus labeling.
132+
126133
## Use ML-assisted data labeling
127134

128135
The **ML-assisted labeling** page lets you trigger automatic machine learning models to accelerate labeling tasks. Medical images (".dcm") are not included in assisted labeling.
@@ -131,14 +138,15 @@ At the beginning of your labeling project, the items are shuffled into a random
131138

132139
Select *Enable ML assisted labeling* and specify a GPU to enable assisted labeling. If you don't have one in your workspace, a GPU cluster will be created for you and added to your workspace. The cluster is created with a minimum of 0 nodes, which means it doesn't cost anything when it's not in use.
133140

134-
135141
ML-assisted labeling consists of two phases:
136142

137143
* Clustering
138144
* Prelabeling
139145

140146
The exact number of labeled data necessary to start assisted labeling is not a fixed number. This can vary significantly from one labeling project to another. For some projects, is sometimes possible to see prelabel or cluster tasks after 300 items have been manually labeled. ML Assisted Labeling uses a technique called *Transfer Learning*, which uses a pre-trained model to jump-start the training process. If your dataset's classes are similar to those in the pre-trained model, pre-labels may be available after only a few hundred manually labeled items. If your dataset is significantly different from the data used to pre-train the model, it may take much longer.
141147

148+
When you're using consensus labeling, the consensus label is used for training.
149+
142150
Since the final labels still rely on input from the labeler, this technology is sometimes called *human in the loop* labeling.
143151

144152
> [!NOTE]
@@ -192,6 +200,24 @@ On the right side is a distribution of the labels for those tasks that are compl
192200

193201
On the **Data** tab, you can see your dataset and review labeled data. Scroll through the labeled data to see the labels. If you see incorrectly labeled data, select it and choose **Reject**, which will remove the labels and put the data back into the unlabeled queue.
194202

203+
If your project uses consensus labeling, you'll also want to review those images without a consensus. To do so:
204+
205+
1. Select the **Data** tab.
206+
1. On the left, select **Review labels**.
207+
1. On the top right, select **All filters**.
208+
209+
:::image type="content" source="media/how-to-create-labeling-projects/select-filters.png" alt-text="Screenshot: select filters to review consensus label problems." lightbox="media/how-to-create-labeling-projects/select-filters.png":::
210+
211+
1. Under **Labeled datapoints**, select **Consensus labels in need of review**. This shows only those images where a consensus was not achieved among the labelers.
212+
213+
:::image type="content" source="media/how-to-create-labeling-projects/select-need-review.png" alt-text="Screenshot: Select labels in need of review.":::
214+
215+
1. For each image in need of review, select the **Consensus label** dropdown to view the conflicting labels.
216+
217+
:::image type="content" source="media/how-to-create-labeling-projects/consensus-dropdown.png" alt-text="Screenshot: Select Consensus label dropdown to review conflicting labels." lightbox="media/how-to-create-labeling-projects/consensus-dropdown.png":::
218+
219+
1. While you can select an individual to see just their label(s), you can only update or reject the labels from the top choice, **Consensus label (preview)**.
220+
195221
### Details tab
196222

197223
View and change details of your project. In this tab you can:
@@ -213,7 +239,7 @@ View and change details of your project. In this tab you can:
213239

214240
## Export the labels
215241

216-
Use the **Export** button on the **Project details** page of your labeling project. You can export the label data for Machine Learning experimentation at any time.
242+
Use the **Export** button on the **Project details** page of your labeling project. You can export the label data for Machine Learning experimentation at any time.
217243

218244
* Image labels can be exported as:
219245
* [COCO format](http://cocodataset.org/#format-data).The COCO file is created in the default blob store of the Azure Machine Learning workspace in a folder within *Labeling/export/coco*.

articles/machine-learning/how-to-create-text-labeling-projects.md

Lines changed: 25 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -125,6 +125,10 @@ To directly upload your data:
125125
>[!NOTE]
126126
> Be sure to note that the labelers will be able to select the first 9 labels by using number keys 1-9.
127127
128+
## Quality control (preview)
129+
130+
[!INCLUDE [describe](../../includes/machine-learning-data-labeling-quality-control.md)]
131+
128132
## Use ML-assisted data labeling
129133

130134
The **ML-assisted labeling** page lets you trigger automatic machine learning models to accelerate labeling tasks. ML-assisted labeling is available for both file (.txt) and tabular (.csv) text data inputs.
@@ -140,7 +144,9 @@ At the beginning of your labeling project, the items are shuffled into a random
140144

141145
For training the text DNN model used by ML-assist, the input text per training example will be limited to approximately the first 128 words in the document. For tabular input, all text columns are first concatenated before applying this limit. This is a practical limit imposed to allow for the model training to complete in a timely manner. The actual text in a document (for file input) or set of text columns (for tabular input) can exceed 128 words. The limit only pertains to what is internally leveraged by the model during the training process.
142146

143-
The exact number of labeled items necessary to start assisted labeling isn't a fixed number. This can vary significantly from one labeling project to another, depending on many factors, including the number of labels classes and label distribution.
147+
The exact number of labeled items necessary to start assisted labeling isn't a fixed number. This can vary significantly from one labeling project to another, depending on many factors, including the number of labels classes and label distribution.
148+
149+
When you're using consensus labeling, the consensus label is used for training.
144150

145151
Since the final labels still rely on input from the labeler, this technology is sometimes called *human in the loop* labeling.
146152

@@ -179,6 +185,24 @@ On the right side is a distribution of the labels for those tasks that are compl
179185

180186
On the **Data** tab, you can see your dataset and review labeled data. Scroll through the labeled data to see the labels. If you see incorrectly labeled data, select it and choose **Reject**, which will remove the labels and put the data back into the unlabeled queue.
181187

188+
If your project uses consensus labeling, you'll also want to review those images without a consensus. To do so:
189+
190+
1. Select the **Data** tab.
191+
1. On the left, select **Review labels**.
192+
1. On the top right, select **All filters**.
193+
194+
:::image type="content" source="media/how-to-create-text-labeling-projects/text-labeling-select-filter.png" alt-text="Screenshot: select filters to review consensus label problems." lightbox="media/how-to-create-text-labeling-projects/text-labeling-select-filter.png":::
195+
196+
1. Under **Labeled datapoints**, select **Consensus labels in need of review**. This shows only those images where a consensus was not achieved among the labelers.
197+
198+
:::image type="content" source="media/how-to-create-labeling-projects/select-need-review.png" alt-text="Screenshot: Select labels in need of review.":::
199+
200+
1. For each item in need of review, select the **Consensus label** dropdown to view the conflicting labels.
201+
202+
:::image type="content" source="media/how-to-create-text-labeling-projects/text-labeling-consensus-dropdown.png" alt-text="Screenshot: Select Consensus label dropdown to review conflicting labels." lightbox="media/how-to-create-text-labeling-projects/text-labeling-consensus-dropdown.png":::
203+
204+
1. While you can select an individual to see just their label(s), you can only update or reject the labels from the top choice, **Consensus label (preview)**.
205+
182206
### Details tab
183207

184208
View and change details of your project. In this tab you can:
349 KB
Loading
166 KB
Loading
37.1 KB
Loading
86.6 KB
Loading
80.9 KB
Loading
Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
---
2+
author: sgilley
3+
ms.service: machine-learning
4+
ms.topic: include
5+
ms.date: 10/21/2021
6+
ms.author: sdgilley
7+
---
8+
9+
To get more accurate labels, use the **Quality control** page to send each item to multiple labelers.
10+
11+
> [!IMPORTANT]
12+
> Consensus labeling is currently in public preview.
13+
> The preview version is provided without a service level agreement, and it's not recommended for production workloads. Certain features might not be supported or might have constrained capabilities.
14+
> For more information, see [Supplemental Terms of Use for Microsoft Azure Previews](https://azure.microsoft.com/support/legal/preview-supplemental-terms/).
15+
16+
Select **Enable consensus labeling (preview)** to have each item sent to multiple labelers. Then set the **Minimum labelers** and **Maximum labelers** to specify how many labelers to use. Make sure you have as many labelers available as your maximum number. You can't later change these settings once the project has started.
17+
18+
If a consensus is reached from the minimum number of labelers, the item is labeled. If a consensus isn't reached, the item will be sent to more labelers. If there's no consensus after the item goes to the maximum number of labelers, its status will be `Needs Review`, and the project owner will be responsible for labeling the item.

0 commit comments

Comments
 (0)