Skip to content

Commit 3ce987e

Browse files
authored
Merge pull request #178422 from sdgilley/sdg-update-dl
minor review changes
2 parents 75d134d + abb75c4 commit 3ce987e

File tree

2 files changed

+12
-7
lines changed

2 files changed

+12
-7
lines changed

articles/machine-learning/how-to-create-text-labeling-projects.md

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -34,10 +34,10 @@ Azure Machine Learning data labeling is a central place to create, manage, and m
3434
> [!Important]
3535
> Text data must be available in an Azure blob datastore. (If you do not have an existing datastore, you may upload files during project creation.)
3636
37-
Text data can be either ".txt" or ".csv" files.
37+
Data formats available for text data:
3838

39-
* For ".txt" files, each file represents one item to be labeled.
40-
* For ".csv" files, each row represents one item presented to the labeler. You can display one or more columns to use when labeling that row.
39+
* **.txt**: each file represents one item to be labeled.
40+
* **.csv** or **.tsv**: each row represents one item presented to the labeler. You decide which columns the labeler can see in order to label the row.
4141

4242
## Prerequisites
4343

@@ -78,7 +78,7 @@ To create a dataset from data that you've already stored in Azure Blob storage:
7878
1. Select **Create a dataset** > **From datastore**.
7979
1. Assign a **Name** to your dataset.
8080
1. Choose the **Dataset type**:
81-
* Select **Tabular** if you're using a .csv file, where each row contains a response.
81+
* Select **Tabular** if you're using a .csv or .tsv file, where each row contains a response.
8282
* Select **File** if you're using separate .txt files for each response.
8383
1. (Optional) Provide a description for your dataset.
8484
1. Select **Next**.
@@ -96,16 +96,16 @@ To directly upload your data:
9696
1. Select **Create a dataset** > **From local files**.
9797
1. Assign a **Name** to your dataset.
9898
1. Choose the **Dataset type**.
99-
* Select **Tabular** if you're using a .csv file, where each row is a response.
99+
* Select **Tabular** if you're using a .csv or .tsv file, where each row is a response.
100100
* Select **File** if you're using separate .txt files for each response.
101101
1. (Optional) Provide a description of your dataset.
102102
1. Select **Next**
103103
1. (Optional) Select or create a datastore. Or keep the default to upload to the default blob store ("workspaceblobstore") of your Machine Learning workspace.
104104
1. Select **Upload** to select the local file(s) or folder(s) to upload.
105105
1. Select **Next**.
106-
1. If uploading .csv files:
106+
1. If uploading .csv or .tsv files:
107107
* Confirm the settings and preview, select **Next**.
108-
* Include all columns of text you'd like the labeler to see when classifying that row.
108+
* Include all columns of text you'd like the labeler to see when classifying that row. If you'll be using ML assisted labeling, adding numeric columns may degrade the ML assist model.
109109
* Select **Next**.
110110
1. Confirm the details. Select **Back** to modify the settings or **Create** to create the dataset.
111111

@@ -114,6 +114,9 @@ To directly upload your data:
114114

115115
[!INCLUDE [refresh](../../includes/machine-learning-data-labeling-refresh.md)]
116116

117+
> [!NOTE]
118+
> Incremental refresh isn't available for projects that use tabular (.csv or .tsv) dataset input.
119+
117120
## Specify label classes
118121

119122
[!INCLUDE [classes](../../includes/machine-learning-data-labeling-classes.md)]

articles/machine-learning/how-to-label-data.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -63,6 +63,8 @@ Machine learning algorithms may be triggered during your labeling. If these algo
6363

6464
Especially early in a labeling project, the machine learning model may only be accurate enough to prelabel a small subset of images. Once these images are labeled, the labeling project will return to manual labeling to gather more data for the next round of model training. Over time, the model will become more confident about a higher proportion of images, resulting in more prelabel tasks later in the project.
6565

66+
When there are no more prelabled tasks, you'll stop confirming or correcting labels and go back to manually tagging the items.
67+
6668
## <a name="image-tasks"></a> Image tasks
6769

6870
For image-classification tasks, you can choose to view multiple images simultaneously. Use the icons above the image area to select the layout.

0 commit comments

Comments
 (0)