You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-create-text-labeling-projects.md
+23-21Lines changed: 23 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ ms.reviewer: sgilley
8
8
ms.service: machine-learning
9
9
ms.subservice: mldata
10
10
ms.topic: how-to
11
-
ms.date: 09/29/2022
11
+
ms.date: 02/03/2023
12
12
ms.custom: data4ml, ignite-fall-2021
13
13
---
14
14
@@ -72,39 +72,41 @@ In many cases, it's fine to just upload local files. But [Azure Storage Explorer
72
72
73
73
To create a dataset from data that you've already stored in Azure Blob storage:
74
74
75
-
1. Select **Create a dataset**> **From datastore**.
76
-
1. Assign a **Name** to your dataset.
75
+
1. Select **+ Create** .
76
+
1. Assign a **Name** to your dataset, and optionally a description.
77
77
1. Choose the **Dataset type**:
78
78
* Select **Tabular** if you're using a .csv or .tsv file, where each row contains a response.
79
79
* Select **File** if you're using separate .txt files for each response.
80
-
1.(Optional) Provide a description for your dataset.
81
-
1. Select **Next**.
82
-
1. Select the datastore.
80
+
1.Select **Next**
81
+
1. Select **From Azure storage**, then **Next**.
82
+
1. Select the datastore, then select **Next**.
83
83
1. If your data is in a subfolder within your blob storage, choose **Browse** to select the path.
84
84
* Append "/**" to the path to include all the files in subfolders of the selected path.
85
85
* Append "**/*.*" to include all the data in the current container and its subfolders.
86
-
1. Select **Next**.
87
-
1.Confirm the details. Select **Back** to modify the settings or **Create** to create the dataset.
86
+
1. Select **Create**.
87
+
1.Now select the data asset you just created.
88
88
89
89
### Create a dataset from uploaded data
90
90
91
91
To directly upload your data:
92
92
93
-
1. Select **Create a dataset** > **From local files**.
94
-
1. Assign a **Name** to your dataset.
95
-
1. Choose the **Dataset type**.
96
-
* Select **Tabular** if you're using a .csv or .tsv file, where each row is a response.
93
+
1. Select **+ Create**.
94
+
1. Assign a **Name** to your dataset, and optionally a description.
95
+
1. Choose the **Dataset type**:
96
+
* Select **Tabular** if you're using a .csv or .tsv file, where each row contains a response.
97
97
* Select **File** if you're using separate .txt files for each response.
98
-
1. (Optional) Provide a description of your dataset.
99
-
1. Select **Next**
100
-
1. (Optional) Select or create a datastore. Or keep the default to upload to the default blob store ("workspaceblobstore") of your Machine Learning workspace.
101
-
1. Select **Upload** to select the local file(s) or folder(s) to upload.
102
98
1. Select **Next**.
103
-
1. If uploading .csv or .tsv files:
104
-
* Confirm the settings and preview, select **Next**.
105
-
* Include all columns of text you'd like the labeler to see when classifying that row. If you'll be using ML assisted labeling, adding numeric columns may degrade the ML assist model.
106
-
* Select **Next**.
107
-
1. Confirm the details. Select **Back** to modify the settings or **Create** to create the dataset.
99
+
1. Select **From local files**, then select **Next**.
100
+
1. (Optional) Select a datastore. Or keep the default to upload to the default blob store ("workspaceblobstore") of your Machine Learning workspace.
101
+
1. Select **Next**.
102
+
1. Select **Upload > Upload files** or **Upload > Upload folder** to select the local files or folder(s) to upload.
103
+
1. In the browser window, find your files or folder, then select **Open**.
104
+
1. Continue using **Upload** until you have specified all your files/folders.
105
+
1. Check the box **Overwrite if already exists** if you wish. Verify the list of files/folders.
106
+
1. Select **Next**.
107
+
1. Confirm the details. Select **Back** to modify the settings or **Create** to create the dataset.
0 commit comments