You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Set up an image labeling project and export labels
16
16
17
-
Learn how to create and run data labeling projects to label images in Azure Machine Learning. Use machine learning-assisted data labeling or human-in-the-loop labeling to help with the task.
17
+
Learn how to create and run data labeling projects to label images in Azure Machine Learning. Use machine learning (ML)-assisted data labeling or human-in-the-loop labeling to help with the task.
18
18
19
19
Set up labels for classification, object detection (bounding box), or instance segmentation (polygon).
20
20
@@ -157,27 +157,27 @@ For bounding boxes, important questions include:
157
157
> [!NOTE]
158
158
> **Instance Segmentation** projects can't use consensus labeling.
159
159
160
-
## Use machine learning-assisted data labeling
160
+
## Use ML-assisted data labeling
161
161
162
162
To accelerate labeling tasks, on the **ML assisted labeling** page, you can trigger automatic machine learning models. Medical images (files that have a *.dcm* extension) aren't included in assisted labeling.
163
163
164
164
At the start of your labeling project, the items are shuffled into a random order to reduce potential bias. However, the trained model reflects any biases that are present in the dataset. For example, if 80 percent of your items are of a single class, then approximately 80 percent of the data used to train the model lands in that class.
165
165
166
166
To enable assisted labeling, select **Enable ML assisted labeling** and specify a GPU. If you don't have a GPU in your workspace, a GPU cluster is created for you and added to your workspace. The cluster is created with a minimum of zero nodes, which means it costs nothing when not in use.
167
167
168
-
Machine learning-assisted labeling consists of two phases:
168
+
ML-assisted labeling consists of two phases:
169
169
170
170
* Clustering
171
171
* Pre-labeling
172
172
173
-
The labeled data item count that's required to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. For some projects, it's sometimes possible to see pre-label or cluster tasks after 300 items have been manually labeled. Machine learning-assisted labeling uses a technique called *transfer learning*. Transfer learning uses a pre-trained model to jump-start the training process. If the classes of your dataset resemble the classes in the pre-trained model, pre-labels might become available after only a few hundred manually labeled items. If your dataset significantly differs from the data that's used to pre-train the model, the process might take more time.
173
+
The labeled data item count that's required to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. For some projects, it's sometimes possible to see pre-label or cluster tasks after 300 items have been manually labeled. ML-assisted labeling uses a technique called *transfer learning*. Transfer learning uses a pre-trained model to jump-start the training process. If the classes of your dataset resemble the classes in the pre-trained model, pre-labels might become available after only a few hundred manually labeled items. If your dataset significantly differs from the data that's used to pre-train the model, the process might take more time.
174
174
175
175
When you use consensus labeling, the consensus label is used for training.
176
176
177
177
Because the final labels still rely on input from the labeler, this technology is sometimes called *human-in-the-loop* labeling.
178
178
179
179
> [!NOTE]
180
-
> Machine learning-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for machine learning-assisted data labeling. The non-default storage account can be secured behind the virtual network.
180
+
> ML-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for ML-assisted data labeling. The non-default storage account can be secured behind the virtual network.
181
181
182
182
### Clustering
183
183
@@ -213,14 +213,14 @@ A distribution of the labels for completed tasks is shown below the chart. In so
213
213
214
214
A distribution of labelers and how many items they've labeled also are shown.
215
215
216
-
The middle section shows a table that has a queue of unassigned tasks. When machine learning-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
216
+
The middle section shows a table that has a queue of unassigned tasks. When ML-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
217
217
218
-
When machine learning-assisted labeling is on, this section also shows:
218
+
When ML-assisted labeling is on, this section also shows:
219
219
220
220
* Tasks that contain clustered items in the queue.
221
221
* Tasks that contain pre-labeled items in the queue.
222
222
223
-
Additionally, when machine learning-assisted labeling is enabled, you can scroll down to see the machine learning-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
223
+
Additionally, when ML-assisted labeling is enabled, you can scroll down to see the ML-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
224
224
225
225
***Training**: Trains a model to predict the labels.
226
226
***Validation**: Determines whether item pre-labeling uses the prediction of this model.
@@ -258,7 +258,7 @@ View and change details of your project. On this tab, you can:
258
258
* View details of the storage container that's used to store labeled outputs in your project.
259
259
* Add labels to your project.
260
260
* Edit instructions you give to your labels.
261
-
* Change settings for machine learning-assisted labeling and kick off a labeling task.
261
+
* Change settings for ML-assisted labeling and kick off a labeling task.
262
262
263
263
### Access for labelers
264
264
@@ -268,7 +268,7 @@ View and change details of your project. On this tab, you can:
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-create-text-labeling-projects.md
+9-9Lines changed: 9 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -139,9 +139,9 @@ To directly upload your data:
139
139
140
140
## Use ML-assisted data labeling
141
141
142
-
To accelerate labeling tasks, the **ML assisted labeling** page can trigger automatic machine learning models. Machine learning-assisted labeling can handle both file (*.txt*) and tabular (*.csv*) text data inputs.
142
+
To accelerate labeling tasks, the **ML assisted labeling** page can trigger automatic machine learning models. Machine learning (ML)-assisted labeling can handle both file (*.txt*) and tabular (*.csv*) text data inputs.
143
143
144
-
To use machine learning-assisted labeling:
144
+
To use ML-assisted labeling:
145
145
146
146
1. Select **Enable ML assisted labeling**.
147
147
1. Select the **Dataset language** for the project. This list shows all languages that the [TextDNNLanguages Class](/python/api/azureml-automl-core/azureml.automl.core.constants.textdnnlanguages?view=azure-ml-py&preserve-view=true) supports.
@@ -151,7 +151,7 @@ To use machine learning-assisted labeling:
151
151
152
152
At the start of your labeling project, the items are shuffled into a random order to reduce potential bias. However, the trained model reflects any biases present in the dataset. For example, if 80 percent of your items are of a single class, then approximately 80 percent of the data that's used to train the model lands in that class.
153
153
154
-
To train the text DNN model that machine learning-assisted labeling uses, the input text per training example is limited to approximately the first 128 words in the document. For tabular input, all text columns are concatenated before this limit is applied. This practical limit allows the model training to complete in a reasonable amount of time. The actual text in a document (for file input) or set of text columns (for tabular input) can exceed 128 words. The limit pertains only to what the model internally uses during the training process.
154
+
To train the text DNN model that ML-assisted labeling uses, the input text per training example is limited to approximately the first 128 words in the document. For tabular input, all text columns are concatenated before this limit is applied. This practical limit allows the model training to complete in a reasonable amount of time. The actual text in a document (for file input) or set of text columns (for tabular input) can exceed 128 words. The limit pertains only to what the model internally uses during the training process.
155
155
156
156
The number of labeled items that's required to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. The variance depends on many factors, including the number of label classes and the label distribution.
157
157
@@ -160,7 +160,7 @@ When you use consensus labeling, the consensus label is used for training.
160
160
Because the final labels still rely on input from the labeler, this technology is sometimes called *human-in-the-loop* labeling.
161
161
162
162
> [!NOTE]
163
-
> Machine learning-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for machine learning-assisted data labeling. The non-default storage account can be secured behind the virtual network.
163
+
> ML-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for ML-assisted data labeling. The non-default storage account can be secured behind the virtual network.
164
164
165
165
### Pre-labeling
166
166
@@ -188,14 +188,14 @@ A distribution of the labels for completed tasks is shown below the chart. In so
188
188
189
189
A distribution of labelers and how many items they've labeled also are shown.
190
190
191
-
The middle section shows a table that has a queue of unassigned tasks. When machine learning-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
191
+
The middle section shows a table that has a queue of unassigned tasks. When ML-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
192
192
193
-
When machine learning-assisted labeling is on, this section also shows:
193
+
When ML-assisted labeling is on, this section also shows:
194
194
195
195
* Tasks that contain clustered items in the queue.
196
196
* Tasks that contain pre-labeled items in the queue.
197
197
198
-
Additionally, when machine learning-assisted labeling is enabled, you can scroll down to see the machine learning-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
198
+
Additionally, when ML-assisted labeling is enabled, you can scroll down to see the ML-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
199
199
200
200
### Data
201
201
@@ -228,7 +228,7 @@ View and change details of your project. On this tab, you can:
228
228
* View details of the storage container that's used to store labeled outputs in your project.
229
229
* Add labels to your project.
230
230
* Edit instructions you give to your labels.
231
-
* Change settings for machine learning-assisted labeling and kick off a labeling task.
231
+
* Change settings for ML-assisted labeling and kick off a labeling task.
232
232
233
233
### Access for labelers
234
234
@@ -238,7 +238,7 @@ View and change details of your project. On this tab, you can:
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-network-security-overview.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -184,7 +184,7 @@ To enable full studio functionality, see [Use Azure Machine Learning studio in a
184
184
185
185
### Limitations
186
186
187
-
[ML-assisted data labeling](how-to-create-image-labeling-projects.md#use-machine-learning-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
187
+
[ML-assisted data labeling](how-to-create-image-labeling-projects.md#use-ml-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
188
188
189
189
> [!TIP]
190
190
> As long as it is not the default storage account, the account used by data labeling can be secured behind the virtual network.
Copy file name to clipboardExpand all lines: articles/machine-learning/v1/how-to-network-security-overview.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -199,7 +199,7 @@ To enable full studio functionality, see [Use Azure Machine Learning studio in a
199
199
200
200
### Limitations
201
201
202
-
[ML-assisted data labeling](../how-to-create-image-labeling-projects.md#use-machine-learning-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
202
+
[ML-assisted data labeling](../how-to-create-image-labeling-projects.md#use-ml-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
203
203
204
204
> [!TIP]
205
205
> As long as it is not the default storage account, the account used by data labeling can be secured behind the virtual network.
Copy file name to clipboardExpand all lines: includes/machine-learning-data-labeling-start-ml-assist.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,12 +6,12 @@ ms.date: 01/27/2023
6
6
ms.author: sdgilley
7
7
---
8
8
9
-
Machine learning-assisted labeling starts automatically after some items have been labeled. This automatic threshold varies by project. You can manually start a machine learning-assisted training run if your project contains at least some labeled data.
9
+
ML-assisted labeling starts automatically after some items have been labeled. This automatic threshold varies by project. You can manually start an ML-assisted training run if your project contains at least some labeled data.
10
10
11
11
> [!NOTE]
12
12
> On-demand training is not available for projects created before December 2022. To use this feature, create a new project.
13
13
14
-
To start a new machine learning-assisted training run:
14
+
To start a new ML-assisted training run:
15
15
16
16
1. At the top of your project, select **Details**.
17
17
1. On the left menu, select **ML assisted labeling**.
0 commit comments