Skip to content

Commit 07ef0e0

Browse files
committed
update text labeling
1 parent 060a03f commit 07ef0e0

File tree

3 files changed

+23
-21
lines changed

3 files changed

+23
-21
lines changed

articles/machine-learning/how-to-create-text-labeling-projects.md

Lines changed: 23 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.reviewer: sgilley
88
ms.service: machine-learning
99
ms.subservice: mldata
1010
ms.topic: how-to
11-
ms.date: 09/29/2022
11+
ms.date: 02/03/2023
1212
ms.custom: data4ml, ignite-fall-2021
1313
---
1414

@@ -72,39 +72,41 @@ In many cases, it's fine to just upload local files. But [Azure Storage Explorer
7272

7373
To create a dataset from data that you've already stored in Azure Blob storage:
7474

75-
1. Select **Create a dataset** > **From datastore**.
76-
1. Assign a **Name** to your dataset.
75+
1. Select **+ Create** .
76+
1. Assign a **Name** to your dataset, and optionally a description.
7777
1. Choose the **Dataset type**:
7878
* Select **Tabular** if you're using a .csv or .tsv file, where each row contains a response.
7979
* Select **File** if you're using separate .txt files for each response.
80-
1. (Optional) Provide a description for your dataset.
81-
1. Select **Next**.
82-
1. Select the datastore.
80+
1. Select **Next**
81+
1. Select **From Azure storage**, then **Next**.
82+
1. Select the datastore, then select **Next**.
8383
1. If your data is in a subfolder within your blob storage, choose **Browse** to select the path.
8484
* Append "/**" to the path to include all the files in subfolders of the selected path.
8585
* Append "**/*.*" to include all the data in the current container and its subfolders.
86-
1. Select **Next**.
87-
1. Confirm the details. Select **Back** to modify the settings or **Create** to create the dataset.
86+
1. Select **Create**.
87+
1. Now select the data asset you just created.
8888

8989
### Create a dataset from uploaded data
9090

9191
To directly upload your data:
9292

93-
1. Select **Create a dataset** > **From local files**.
94-
1. Assign a **Name** to your dataset.
95-
1. Choose the **Dataset type**.
96-
* Select **Tabular** if you're using a .csv or .tsv file, where each row is a response.
93+
1. Select **+ Create**.
94+
1. Assign a **Name** to your dataset, and optionally a description.
95+
1. Choose the **Dataset type**:
96+
* Select **Tabular** if you're using a .csv or .tsv file, where each row contains a response.
9797
* Select **File** if you're using separate .txt files for each response.
98-
1. (Optional) Provide a description of your dataset.
99-
1. Select **Next**
100-
1. (Optional) Select or create a datastore. Or keep the default to upload to the default blob store ("workspaceblobstore") of your Machine Learning workspace.
101-
1. Select **Upload** to select the local file(s) or folder(s) to upload.
10298
1. Select **Next**.
103-
1. If uploading .csv or .tsv files:
104-
* Confirm the settings and preview, select **Next**.
105-
* Include all columns of text you'd like the labeler to see when classifying that row. If you'll be using ML assisted labeling, adding numeric columns may degrade the ML assist model.
106-
* Select **Next**.
107-
1. Confirm the details. Select **Back** to modify the settings or **Create** to create the dataset.
99+
1. Select **From local files**, then select **Next**.
100+
1. (Optional) Select a datastore. Or keep the default to upload to the default blob store ("workspaceblobstore") of your Machine Learning workspace.
101+
1. Select **Next**.
102+
1. Select **Upload > Upload files** or **Upload > Upload folder** to select the local files or folder(s) to upload.
103+
1. In the browser window, find your files or folder, then select **Open**.
104+
1. Continue using **Upload** until you have specified all your files/folders.
105+
1. Check the box **Overwrite if already exists** if you wish. Verify the list of files/folders.
106+
1. Select **Next**.
107+
1. Confirm the details. Select **Back** to modify the settings or **Create** to create the dataset.
108+
1. Now select the data asset you just created.
109+
108110

109111

110112
## Configure incremental refresh
-145 KB
Loading
45.8 KB
Loading

0 commit comments

Comments
 (0)