You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-use-labeled-dataset.md
+11-44Lines changed: 11 additions & 44 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,14 +8,14 @@ ms.service: machine-learning
8
8
ms.subservice: mldata
9
9
ms.topic: how-to
10
10
ms.custom: data4ml
11
-
ms.date: 10/21/2021
11
+
ms.date: 02/15/2022
12
12
13
13
# Customer intent: As an experienced Python developer, I need to export my data labels and use them for machine learning tasks.
14
14
---
15
15
16
16
# Create and explore Azure Machine Learning dataset with labels
17
17
18
-
In this article, you'll learn how to export the data labels from an Azure Machine Learning data labeling project and load them into popular formats such as, a pandas dataframe for data exploration or a Torchvision dataset for image transformation.
18
+
In this article, you'll learn how to export the data labels from an Azure Machine Learning data labeling project and load them into popular formats such as, a pandas dataframe for data exploration.
19
19
20
20
## What are datasets with labels
21
21
@@ -25,7 +25,6 @@ We refer to Azure Machine Learning datasets with labels as labeled datasets. The
25
25
26
26
* An Azure subscription. If you don’t have an Azure subscription, create a [free account](https://azure.microsoft.com/free/) before you begin.
27
27
* The [Azure Machine Learning SDK for Python](/python/api/overview/azure/ml/intro), or access to [Azure Machine Learning studio](https://ml.azure.com/).
28
-
* Install the [azure-contrib-dataset](/python/api/azureml-contrib-dataset/) package
29
28
* A Machine Learning workspace. See [Create an Azure Machine Learning workspace](how-to-manage-workspace.md).
30
29
* Access to an Azure Machine Learning data labeling project. If you don't have a labeling project, first create one for [image labeling](how-to-create-image-labeling-projects.md) or [text labeling](how-to-create-text-labeling-projects.md).
31
30
@@ -50,34 +49,31 @@ Once you have exported your labeled data to an Azure Machine Learning dataset, y
50
49
51
50
## Explore labeled datasets
52
51
53
-
Load your labeled datasets into a pandas dataframe or Torchvision dataset to leverage popular open-source libraries for data exploration, as well as PyTorch provided libraries for image transformation and training.
52
+
Load your labeled datasets into a pandas dataframe to leverage popular open-source libraries for data exploration.
54
53
55
54
### Pandas dataframe
56
55
57
-
You can load labeled datasets into a pandas dataframe with the [`to_pandas_dataframe()`](/python/api/azureml-core/azureml.data.tabulardataset#to-pandas-dataframe-on-error--null---out-of-range-datetime--null--) method from the `azureml-contrib-dataset` class. Install the class with the following shell command:
56
+
You can load labeled datasets into a pandas dataframe with the [`to_pandas_dataframe()`](/python/api/azureml-core/azureml.data.tabulardataset#to-pandas-dataframe-on-error--null---out-of-range-datetime--null--) method from the `azureml-dataprep` class. Install the class with the following shell command:
58
57
59
58
```shell
60
-
pip install azureml-contrib-dataset
59
+
pip install azureml-dataprep
61
60
```
62
61
63
-
>[!NOTE]
64
-
>The azureml.contrib namespace changes frequently, as we work to improve the service. As such, anything in this namespace should be considered as a preview, and not fully supported by Microsoft.
65
-
66
-
Azure Machine Learning offers the following file handling options for file streams when converting to a pandas dataframe.
67
-
* Download: Download your data files to a local path.
68
-
* Mount: Mount your data files to a mount point. Mount only works for Linux-based compute, including Azure Machine Learning notebook VM and Azure Machine Learning Compute.
62
+
The exported dataset is a TabularDataset. If you plan to use download() or moint() methods, be sure to set the parameter `stream column ='<image_url>'`
69
63
70
64
In the following code, the `animal_labels` dataset is the output from a labeling project previously saved to the workspace.
71
65
72
66
```Python
73
67
import azureml.core
74
-
import azureml.contrib.dataset
75
68
from azureml.core import Dataset, Workspace
76
-
from azureml.contrib.dataset import FileHandlingOption
You can load labeled datasets into Torchvision dataset with the [to_torchvision()](/python/api/azureml-contrib-dataset/azureml.contrib.dataset.tabulardataset#to-torchvision--) method also from the `azureml-contrib-dataset` class. To use this method, you need to have [PyTorch](https://pytorch.org/) installed.
93
-
94
-
In the following code, the `animal_labels` dataset is the output from a labeling project previously saved to the workspace.
95
-
96
-
```python
97
-
import azureml.core
98
-
import azureml.contrib.dataset
99
-
from azureml.core import Dataset, Workspace
100
-
from azureml.contrib.dataset import FileHandlingOption
101
-
102
-
from torchvision.transforms import functional as F
0 commit comments