Skip to content

Commit ea944be

Browse files
authored
Update how-to-prepare-datasets-for-automl-images.md
1 parent b588a96 commit ea944be

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

articles/machine-learning/v1/how-to-prepare-datasets-for-automl-images.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.service: machine-learning
88
ms.subservice: automl
99
ms.topic: how-to
1010
ms.custom: UpdateFrequency5, template-how-to, sdkv1
11-
ms.date: 10/13/2021
11+
ms.date: 04/01/2024
1212
---
1313

1414
# Prepare data for computer vision tasks with automated machine learning v1
@@ -23,7 +23,7 @@ ms.date: 10/13/2021
2323
2424
In this article, you learn how to prepare image data for training computer vision models with [automated machine learning in Azure Machine Learning](../concept-automated-ml.md).
2525

26-
To generate models for computer vision tasks with automated machine learning, you need to bring labeled image data as input for model training in the form of an [Azure Machine Learning TabularDataset](/python/api/azureml-core/azureml.data.tabulardataset).
26+
To generate models for computer vision tasks with AutoML, you need to bring labeled image data as input for model training in the form of an [Azure Machine Learning TabularDataset](/python/api/azureml-core/azureml.data.tabulardataset).
2727

2828
To ensure your TabularDataset contains the accepted schema for consumption in automated ML, you can use the Azure Machine Learning data labeling tool or use a conversion script.
2929

@@ -49,22 +49,22 @@ If you already have a data labeling project and you want to use that data, you c
4949

5050
If you have labeled data in popular computer vision data formats, like VOC or COCO, [helper scripts](https://github.com/Azure/azureml-examples/blob/v1-archive/v1/python-sdk/tutorials/automl-with-azureml/image-object-detection/coco2jsonl.py) to generate JSONL files for training and validation data are available in [notebook examples](https://github.com/Azure/azureml-examples/tree/v1-archive/v1/python-sdk/tutorials/automl-with-azureml).
5151

52-
If your data doesn't follow any of the previously mentioned formats, you can use your own script to generate JSON Lines files based on schemas defined in [Schema for JSONL files for AutoML image experiments](../reference-automl-images-schema.md).
52+
If your data doesn't follow any of the previously mentioned formats, you can use your own script to generate JSON Lines files. To generate JSON Lines files, use schemas defined in [Schema for JSONL files for AutoML image experiments](../reference-automl-images-schema.md).
5353

54-
After your data file(s) are converted to the accepted JSONL format, you can upload them to your storage account on Azure.
54+
After your data files are converted to the accepted JSONL format, you can upload them to your storage account on Azure.
5555

5656
## Upload the JSONL file and images to storage
5757

5858
To use the data for automated ML training, upload the data to your [Azure Machine Learning workspace](../concept-workspace.md) via a [datastore](../how-to-access-data.md). The datastore provides a mechanism for you to upload/download data to storage on Azure, and interact with it from your remote compute targets.
5959

60-
Upload the entire parent directory consisting of images and JSONL files to the default datastore that is automatically created upon workspace creation. This datastore connects to the default Azure blob storage container that was created as part of workspace creation.
60+
Upload the entire parent directory consisting of images and JSONL files to the default datastore that is automatically created upon workspace creation. This datastore connects to the default Azure blob storage container that was created as part of workspace creation.
6161

6262
```python
6363
# Retrieve default datastore that's automatically created when we setup a workspace
6464
ds = ws.get_default_datastore()
6565
ds.upload(src_dir='./fridgeObjects', target_path='fridgeObjects')
6666
```
67-
Once the data upload is done, you can create an [Azure Machine Learning TabularDataset](/python/api/azureml-core/azureml.data.tabulardataset) and register it to your workspace for future use as input to your automated ML experiments for computer vision models.
67+
Once the data upload is done, you can create an [Azure Machine Learning TabularDataset.](/python/api/azureml-core/azureml.data.tabulardataset) Then, register the dataset to your workspace for future use as input to your automated ML experiments for computer vision models.
6868

6969
```python
7070
from azureml.core import Dataset

0 commit comments

Comments
 (0)