Skip to content

Commit b308570

Browse files
Merge pull request #270184 from ssalgadodev/patch-80
Update how-to-prepare-datasets-for-automl-images.md
2 parents 5710211 + d4ea645 commit b308570

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

articles/machine-learning/how-to-prepare-datasets-for-automl-images.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.subservice: automl
99
ms.topic: how-to
1010
ms.custom: template-how-to, update-code, sdkv2,
1111
ms.reviewer: ssalgado
12-
ms.date: 05/26/2022
12+
ms.date: 03/26/2024
1313
---
1414

1515
# Prepare data for computer vision tasks with automated machine learning
@@ -43,7 +43,7 @@ It helps to create, manage, and monitor data labeling tasks for
4343
+ Object detection (bounding box)
4444
+ Instance segmentation (polygon)
4545

46-
If you already have a data labeling project and you want to use that data, you can [export your labeled data as an Azure Machine Learning Dataset](how-to-manage-labeling-projects.md#export-the-labels) and then access the dataset under 'Datasets' tab in Azure Machine Learning studio. This exported dataset can then be passed as an input using `azureml:<tabulardataset_name>:<version>` format. Here's an example of how to pass existing dataset as input for training computer vision models.
46+
If you already have labeled data you want to use, you can [export your labeled data as an Azure Machine Learning Dataset](how-to-manage-labeling-projects.md#export-the-labels) and then access the dataset under 'Datasets' tab in Azure Machine Learning studio. This exported dataset can then be passed as an input using `azureml:<tabulardataset_name>:<version>` format. Here's an example of how to pass existing dataset as input for training computer vision models.
4747

4848
# [Azure CLI](#tab/cli)
4949

@@ -78,7 +78,7 @@ Refer to CLI/SDK tabs for reference.
7878
---
7979

8080
### Using prelabeled training data from local machine
81-
If you have previously labeled data that you would like to use to train your model, you'll first need to upload the images to the default Azure Blob Storage of your Azure Machine Learning Workspace and register it as a [data asset](how-to-create-data-assets.md).
81+
If you have labeled data that you would like to use to train your model, you need to upload the images to Azure. You can upload the your images to the default Azure Blob Storage of your Azure Machine Learning Workspace and register it as a [data asset](how-to-create-data-assets.md).
8282

8383
The following script uploads the image data on your local machine at path "./data/odFridgeObjects" to datastore in Azure Blob Storage. It then creates a new data asset with the name "fridge-items-images-object-detection" in your Azure Machine Learning Workspace.
8484

@@ -149,11 +149,11 @@ my_data = Data(
149149

150150
---
151151

152-
Next, you'll need to get the label annotations in JSONL format. The schema of labeled data depends on the computer vision task at hand. Refer to [schemas for JSONL files for AutoML computer vision experiments](reference-automl-images-schema.md) to learn more about the required JSONL schema for each task type.
152+
Next, you need to get the label annotations in JSONL format. The schema of labeled data depends on the computer vision task at hand. Refer to [schemas for JSONL files for AutoML computer vision experiments](reference-automl-images-schema.md) to learn more about the required JSONL schema for each task type.
153153

154154
If your training data is in a different format (like, pascal VOC or COCO), [helper scripts](https://github.com/Azure/azureml-examples/blob/v1-archive/v1/python-sdk/tutorials/automl-with-azureml/image-object-detection/coco2jsonl.py) to convert the data to JSONL are available in [notebook examples](https://github.com/Azure/azureml-examples/blob/main/sdk/python/jobs/automl-standalone-jobs).
155155

156-
Once you have created jsonl file following the above steps, you can register it as a data asset using UI. Make sure you select `stream` type in schema section as shown below.
156+
Once you created jsonl file following the above steps, you can register it as a data asset using UI. Make sure you select `stream` type in schema section as shown in this animation.
157157

158158
![Animation showing how to register a data asset from the jsonl files](media\how-to-prepare-datasets-for-automl-images\ui-dataset-jsnol.gif)
159159

@@ -162,7 +162,7 @@ If you have your labeled training data present in a container in Azure Blob stor
162162

163163
## Create MLTable
164164

165-
Once you have your labeled data in JSONL format, you can use it to create `MLTable` as shown below. MLtable packages your data into a consumable object for training.
165+
Once you have your labeled data in JSONL format, you can use it to create `MLTable` as shown in this yaml snippet. MLtable packages your data into a consumable object for training.
166166

167167
```yaml
168168
paths:

0 commit comments

Comments
 (0)