Skip to content

Commit 952d0f3

Browse files
committed
freshness pass
1 parent 8487990 commit 952d0f3

File tree

1 file changed

+5
-11
lines changed

1 file changed

+5
-11
lines changed

articles/machine-learning/v1/how-to-use-labeled-dataset.md

Lines changed: 5 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -9,13 +9,13 @@ ms.service: azure-machine-learning
99
ms.subservice: mldata
1010
ms.topic: how-to
1111
ms.custom: UpdateFrequency5, data4ml, sdkv1
12-
ms.date: 08/17/2022
12+
ms.date: 03/10/2025
1313
#Customer intent: As an experienced Python developer, I need to export my data labels and use them for machine learning tasks.
1414
---
1515

1616
# Create and explore Azure Machine Learning dataset with labels
1717

18-
In this article, you'll learn how to export the data labels from an Azure Machine Learning data labeling project and load them into popular formats such as, a pandas dataframe for data exploration.
18+
In this article, you learn how to export the data labels from an Azure Machine Learning data labeling project and load them into popular formats, such as a pandas dataframe for data exploration.
1919

2020
## What are datasets with labels
2121

@@ -41,7 +41,7 @@ Use the **Export** button on the **Project details** page of your labeling proje
4141
The COCO file is created in the default blob store of the Azure Machine Learning workspace in a folder within *export/coco*.
4242

4343
>[!NOTE]
44-
>In object detection projects, the exported "bbox": [x,y,width,height]" values in COCO file are normalized. They are scaled to 1. Example : a bounding box at (10, 10) location, with 30 pixels width , 60 pixels height, in a 640x480 pixel image will be annotated as (0.015625. 0.02083, 0.046875, 0.125). Since the coordintes are normalized, it will show as '0.0' as "width" and "height" for all images. The actual width and height can be obtained using Python library like OpenCV or Pillow(PIL).
44+
>In object detection projects, the exported `bbox: [x,y,width,height]` values in COCO file are normalized. They're scaled to 1. For example, a bounding box at (10, 10) location, with 30 pixels width, 60 pixels height, in a 640x480 pixel image is annotated as (0.015625. 0.02083, 0.046875, 0.125). Since the coordinates are normalized, it shows as '0.0' as "width" and "height" for all images. The actual width and height can be obtained using Python library like OpenCV or Pillow(PIL).
4545
4646
### Azure Machine Learning dataset
4747

@@ -50,11 +50,11 @@ You can access the exported Azure Machine Learning dataset in the **Datasets** s
5050
![Exported dataset](../media/how-to-create-labeling-projects/exported-dataset.png)
5151

5252
> [!TIP]
53-
> Once you have exported your labeled data to an Azure Machine Learning dataset, you can use AutoML to build computer vision models trained on your labeled data. Learn more at [Set up AutoML to train computer vision models with Python](../how-to-auto-train-image-models.md)
53+
> Once you export your labeled data to an Azure Machine Learning dataset, you can use AutoML to build computer vision models trained on your labeled data. Learn more at [Set up AutoML to train computer vision models with Python](../how-to-auto-train-image-models.md)
5454
5555
## Explore labeled datasets via pandas dataframe
5656

57-
Load your labeled datasets into a pandas dataframe to leverage popular open-source libraries for data exploration with the [`to_pandas_dataframe()`](/python/api/azureml-core/azureml.data.tabulardataset#to-pandas-dataframe-on-error--null---out-of-range-datetime--null--) method from the `azureml-dataprep` class.
57+
Load your labeled datasets into a pandas dataframe to use popular open-source libraries for data exploration with the [`to_pandas_dataframe()`](/python/api/azureml-core/azureml.data.tabulardataset#to-pandas-dataframe-on-error--null---out-of-range-datetime--null--) method from the `azureml-dataprep` class.
5858

5959
Install the class with the following shell command:
6060

@@ -75,12 +75,6 @@ from azureml.core import Dataset, Workspace
7575
animal_labels = Dataset.get_by_name(workspace, 'animal_labels')
7676
animal_pd = animal_labels.to_pandas_dataframe()
7777

78-
import matplotlib.pyplot as plt
79-
import matplotlib.image as mpimg
80-
81-
#read images from dataset
82-
img = mpimg.imread(animal_pd['image_url'].iloc(0).open())
83-
imgplot = plt.imshow(img)
8478
```
8579

8680
## Next steps

0 commit comments

Comments
 (0)