Skip to content

Commit cf93a0c

Browse files
committed
edits
1 parent 1825962 commit cf93a0c

5 files changed

+23
-23
lines changed

articles/machine-learning/how-to-create-image-labeling-projects.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ ms.custom: data4ml, ignite-fall-2021, ignite-2022
1414

1515
# Set up an image labeling project and export labels
1616

17-
Learn how to create and run data labeling projects to label images in Azure Machine Learning. Use machine learning-assisted data labeling or human-in-the-loop labeling to help with the task.
17+
Learn how to create and run data labeling projects to label images in Azure Machine Learning. Use machine learning (ML)-assisted data labeling or human-in-the-loop labeling to help with the task.
1818

1919
Set up labels for classification, object detection (bounding box), or instance segmentation (polygon).
2020

@@ -157,27 +157,27 @@ For bounding boxes, important questions include:
157157
> [!NOTE]
158158
> **Instance Segmentation** projects can't use consensus labeling.
159159
160-
## Use machine learning-assisted data labeling
160+
## Use ML-assisted data labeling
161161

162162
To accelerate labeling tasks, on the **ML assisted labeling** page, you can trigger automatic machine learning models. Medical images (files that have a *.dcm* extension) aren't included in assisted labeling.
163163

164164
At the start of your labeling project, the items are shuffled into a random order to reduce potential bias. However, the trained model reflects any biases that are present in the dataset. For example, if 80 percent of your items are of a single class, then approximately 80 percent of the data used to train the model lands in that class.
165165

166166
To enable assisted labeling, select **Enable ML assisted labeling** and specify a GPU. If you don't have a GPU in your workspace, a GPU cluster is created for you and added to your workspace. The cluster is created with a minimum of zero nodes, which means it costs nothing when not in use.
167167

168-
Machine learning-assisted labeling consists of two phases:
168+
ML-assisted labeling consists of two phases:
169169

170170
* Clustering
171171
* Pre-labeling
172172

173-
The labeled data item count that's required to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. For some projects, it's sometimes possible to see pre-label or cluster tasks after 300 items have been manually labeled. Machine learning-assisted labeling uses a technique called *transfer learning*. Transfer learning uses a pre-trained model to jump-start the training process. If the classes of your dataset resemble the classes in the pre-trained model, pre-labels might become available after only a few hundred manually labeled items. If your dataset significantly differs from the data that's used to pre-train the model, the process might take more time.
173+
The labeled data item count that's required to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. For some projects, it's sometimes possible to see pre-label or cluster tasks after 300 items have been manually labeled. ML-assisted labeling uses a technique called *transfer learning*. Transfer learning uses a pre-trained model to jump-start the training process. If the classes of your dataset resemble the classes in the pre-trained model, pre-labels might become available after only a few hundred manually labeled items. If your dataset significantly differs from the data that's used to pre-train the model, the process might take more time.
174174

175175
When you use consensus labeling, the consensus label is used for training.
176176

177177
Because the final labels still rely on input from the labeler, this technology is sometimes called *human-in-the-loop* labeling.
178178

179179
> [!NOTE]
180-
> Machine learning-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for machine learning-assisted data labeling. The non-default storage account can be secured behind the virtual network.
180+
> ML-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for ML-assisted data labeling. The non-default storage account can be secured behind the virtual network.
181181
182182
### Clustering
183183

@@ -213,14 +213,14 @@ A distribution of the labels for completed tasks is shown below the chart. In so
213213

214214
A distribution of labelers and how many items they've labeled also are shown.
215215

216-
The middle section shows a table that has a queue of unassigned tasks. When machine learning-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
216+
The middle section shows a table that has a queue of unassigned tasks. When ML-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
217217

218-
When machine learning-assisted labeling is on, this section also shows:
218+
When ML-assisted labeling is on, this section also shows:
219219

220220
* Tasks that contain clustered items in the queue.
221221
* Tasks that contain pre-labeled items in the queue.
222222

223-
Additionally, when machine learning-assisted labeling is enabled, you can scroll down to see the machine learning-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
223+
Additionally, when ML-assisted labeling is enabled, you can scroll down to see the ML-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
224224

225225
* **Training**: Trains a model to predict the labels.
226226
* **Validation**: Determines whether item pre-labeling uses the prediction of this model.
@@ -258,7 +258,7 @@ View and change details of your project. On this tab, you can:
258258
* View details of the storage container that's used to store labeled outputs in your project.
259259
* Add labels to your project.
260260
* Edit instructions you give to your labels.
261-
* Change settings for machine learning-assisted labeling and kick off a labeling task.
261+
* Change settings for ML-assisted labeling and kick off a labeling task.
262262

263263
### Access for labelers
264264

@@ -268,7 +268,7 @@ View and change details of your project. On this tab, you can:
268268

269269
[!INCLUDE [add-label](../../includes/machine-learning-data-labeling-add-label.md)]
270270

271-
## Start a machine learning-assisted labeling task
271+
## Start an ML-assisted labeling task
272272

273273
[!INCLUDE [start-ml-assist](../../includes/machine-learning-data-labeling-start-ml-assist.md)]
274274

articles/machine-learning/how-to-create-text-labeling-projects.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -139,9 +139,9 @@ To directly upload your data:
139139

140140
## Use ML-assisted data labeling
141141

142-
To accelerate labeling tasks, the **ML assisted labeling** page can trigger automatic machine learning models. Machine learning-assisted labeling can handle both file (*.txt*) and tabular (*.csv*) text data inputs.
142+
To accelerate labeling tasks, the **ML assisted labeling** page can trigger automatic machine learning models. Machine learning (ML)-assisted labeling can handle both file (*.txt*) and tabular (*.csv*) text data inputs.
143143

144-
To use machine learning-assisted labeling:
144+
To use ML-assisted labeling:
145145

146146
1. Select **Enable ML assisted labeling**.
147147
1. Select the **Dataset language** for the project. This list shows all languages that the [TextDNNLanguages Class](/python/api/azureml-automl-core/azureml.automl.core.constants.textdnnlanguages?view=azure-ml-py&preserve-view=true) supports.
@@ -151,7 +151,7 @@ To use machine learning-assisted labeling:
151151

152152
At the start of your labeling project, the items are shuffled into a random order to reduce potential bias. However, the trained model reflects any biases present in the dataset. For example, if 80 percent of your items are of a single class, then approximately 80 percent of the data that's used to train the model lands in that class.
153153

154-
To train the text DNN model that machine learning-assisted labeling uses, the input text per training example is limited to approximately the first 128 words in the document. For tabular input, all text columns are concatenated before this limit is applied. This practical limit allows the model training to complete in a reasonable amount of time. The actual text in a document (for file input) or set of text columns (for tabular input) can exceed 128 words. The limit pertains only to what the model internally uses during the training process.
154+
To train the text DNN model that ML-assisted labeling uses, the input text per training example is limited to approximately the first 128 words in the document. For tabular input, all text columns are concatenated before this limit is applied. This practical limit allows the model training to complete in a reasonable amount of time. The actual text in a document (for file input) or set of text columns (for tabular input) can exceed 128 words. The limit pertains only to what the model internally uses during the training process.
155155

156156
The number of labeled items that's required to start assisted labeling isn't a fixed number. This number can vary significantly from one labeling project to another. The variance depends on many factors, including the number of label classes and the label distribution.
157157

@@ -160,7 +160,7 @@ When you use consensus labeling, the consensus label is used for training.
160160
Because the final labels still rely on input from the labeler, this technology is sometimes called *human-in-the-loop* labeling.
161161

162162
> [!NOTE]
163-
> Machine learning-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for machine learning-assisted data labeling. The non-default storage account can be secured behind the virtual network.
163+
> ML-assisted data labeling doesn't support default storage accounts that are secured behind a [virtual network](how-to-network-security-overview.md). You must use a non-default storage account for ML-assisted data labeling. The non-default storage account can be secured behind the virtual network.
164164
165165
### Pre-labeling
166166

@@ -188,14 +188,14 @@ A distribution of the labels for completed tasks is shown below the chart. In so
188188

189189
A distribution of labelers and how many items they've labeled also are shown.
190190

191-
The middle section shows a table that has a queue of unassigned tasks. When machine learning-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
191+
The middle section shows a table that has a queue of unassigned tasks. When ML-assisted labeling is off, this section shows the number of manual tasks that are awaiting assignment.
192192

193-
When machine learning-assisted labeling is on, this section also shows:
193+
When ML-assisted labeling is on, this section also shows:
194194

195195
* Tasks that contain clustered items in the queue.
196196
* Tasks that contain pre-labeled items in the queue.
197197

198-
Additionally, when machine learning-assisted labeling is enabled, you can scroll down to see the machine learning-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
198+
Additionally, when ML-assisted labeling is enabled, you can scroll down to see the ML-assisted labeling status. The **Jobs** sections give links for each of the machine learning runs.
199199

200200
### Data
201201

@@ -228,7 +228,7 @@ View and change details of your project. On this tab, you can:
228228
* View details of the storage container that's used to store labeled outputs in your project.
229229
* Add labels to your project.
230230
* Edit instructions you give to your labels.
231-
* Change settings for machine learning-assisted labeling and kick off a labeling task.
231+
* Change settings for ML-assisted labeling and kick off a labeling task.
232232

233233
### Access for labelers
234234

@@ -238,7 +238,7 @@ View and change details of your project. On this tab, you can:
238238

239239
[!INCLUDE [add-label](../../includes/machine-learning-data-labeling-add-label.md)]
240240

241-
## Start a machine learning-assisted labeling task
241+
## Start an ML-assisted labeling task
242242

243243
[!INCLUDE [start-ml-assist](../../includes/machine-learning-data-labeling-start-ml-assist.md)]
244244

articles/machine-learning/how-to-network-security-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -184,7 +184,7 @@ To enable full studio functionality, see [Use Azure Machine Learning studio in a
184184

185185
### Limitations
186186

187-
[ML-assisted data labeling](how-to-create-image-labeling-projects.md#use-machine-learning-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
187+
[ML-assisted data labeling](how-to-create-image-labeling-projects.md#use-ml-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
188188

189189
> [!TIP]
190190
> As long as it is not the default storage account, the account used by data labeling can be secured behind the virtual network.

articles/machine-learning/v1/how-to-network-security-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -199,7 +199,7 @@ To enable full studio functionality, see [Use Azure Machine Learning studio in a
199199

200200
### Limitations
201201

202-
[ML-assisted data labeling](../how-to-create-image-labeling-projects.md#use-machine-learning-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
202+
[ML-assisted data labeling](../how-to-create-image-labeling-projects.md#use-ml-assisted-data-labeling) doesn't support a default storage account behind a virtual network. Instead, use a storage account other than the default for ML assisted data labeling.
203203

204204
> [!TIP]
205205
> As long as it is not the default storage account, the account used by data labeling can be secured behind the virtual network.

includes/machine-learning-data-labeling-start-ml-assist.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,12 +6,12 @@ ms.date: 01/27/2023
66
ms.author: sdgilley
77
---
88

9-
Machine learning-assisted labeling starts automatically after some items have been labeled. This automatic threshold varies by project. You can manually start a machine learning-assisted training run if your project contains at least some labeled data.
9+
ML-assisted labeling starts automatically after some items have been labeled. This automatic threshold varies by project. You can manually start an ML-assisted training run if your project contains at least some labeled data.
1010

1111
> [!NOTE]
1212
> On-demand training is not available for projects created before December 2022. To use this feature, create a new project.
1313
14-
To start a new machine learning-assisted training run:
14+
To start a new ML-assisted training run:
1515

1616
1. At the top of your project, select **Details**.
1717
1. On the left menu, select **ML assisted labeling**.

0 commit comments

Comments
 (0)