Skip to content

Commit ea11b64

Browse files
committed
acrolinx changes
1 parent fdd60a9 commit ea11b64

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

articles/machine-learning/how-to-prepare-datasets-for-automl-images.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -32,18 +32,18 @@ If your labeled training data is in a different format (like, pascal VOC or COCO
3232
* Familiarize yourself with the accepted [schemas for JSONL files for AutoML computer vision experiments](reference-automl-images-schema.md).
3333

3434
## Get labeled data
35-
In order to train computer vision models using AutoML, you need to first get labeled training data. The images need to be uploaded to the cloud and label annotations need to be in JSONL format. You can either use the Azure Machine Learning Data Labeling tool to label your data or you could start with pre-labeled image data.
35+
In order to train computer vision models using AutoML, you need to first get labeled training data. The images need to be uploaded to the cloud and label annotations need to be in JSONL format. You can either use the Azure Machine Learning Data Labeling tool to label your data or you could start with prelabeled image data.
3636

3737
### Using Azure Machine Learning Data Labeling tool to label your training data
38-
If you don't have pre-labeled data, you can use Azure Machine Learning's [data labeling tool](how-to-create-image-labeling-projects.md) to manually label images. This tool automatically generates the data required for training in the accepted format.
38+
If you don't have prelabeled data, you can use Azure Machine Learning's [data labeling tool](how-to-create-image-labeling-projects.md) to manually label images. This tool automatically generates the data required for training in the accepted format.
3939

4040
It helps to create, manage, and monitor data labeling tasks for
4141

4242
+ Image classification (multi-class and multi-label)
4343
+ Object detection (bounding box)
4444
+ Instance segmentation (polygon)
4545

46-
If you already have a data labeling project and you want to use that data, you can [export your labeled data as an Azure Machine Learning Dataset](how-to-create-image-labeling-projects.md#export-the-labels) and then access the dataset under 'Datasets' tab in Azure Machine Learning Studio. This exported dataset can then be passed as an input using `azureml:<tabulardataset_name>:<version>` format. Here is an example on how to pass existing dataset as input for training computer vision models.
46+
If you already have a data labeling project and you want to use that data, you can [export your labeled data as an Azure Machine Learning Dataset](how-to-create-image-labeling-projects.md#export-the-labels) and then access the dataset under 'Datasets' tab in Azure Machine Learning studio. This exported dataset can then be passed as an input using `azureml:<tabulardataset_name>:<version>` format. Here's an example of how to pass existing dataset as input for training computer vision models.
4747

4848
# [Azure CLI](#tab/cli)
4949

@@ -73,21 +73,21 @@ my_training_data_input = Input(
7373

7474
# [Studio](#tab/Studio)
7575

76-
Please refer to Cli/Sdk tabs for reference.
76+
Refer to Cli/Sdk tabs for reference.
7777

7878
---
7979

80-
### Using pre-labeled training data from local machine
81-
If you have previously labeled data that you would like to use to train your model, you will first need to upload the images to the default Azure Blob Storage of your Azure Machine Learning Workspace and register it as a [data asset](how-to-create-data-assets.md).
80+
### Using prelabeled training data from local machine
81+
If you have previously labeled data that you would like to use to train your model, you'll first need to upload the images to the default Azure Blob Storage of your Azure Machine Learning Workspace and register it as a [data asset](how-to-create-data-assets.md).
8282

8383
The following script uploads the image data on your local machine at path "./data/odFridgeObjects" to datastore in Azure Blob Storage. It then creates a new data asset with the name "fridge-items-images-object-detection" in your Azure Machine Learning Workspace.
8484

85-
If there already exists a data asset with the name "fridge-items-images-object-detection" in your Azure Machine Learning Workspace, it will update the version number of the data asset and point it to the new location where the image data uploaded.
85+
If there already exists a data asset with the name "fridge-items-images-object-detection" in your Azure Machine Learning Workspace, it updates the version number of the data asset and point it to the new location where the image data uploaded.
8686

8787
# [Azure CLI](#tab/cli)
8888
[!INCLUDE [cli v2](includes/machine-learning-cli-v2.md)]
8989

90-
Create a .yml file with the following configuration.
90+
Create an .yml file with the following configuration.
9191

9292
```yml
9393
$schema: https://azuremlschemas.azureedge.net/latest/data.schema.json
@@ -97,7 +97,7 @@ path: ./data/odFridgeObjects
9797
type: uri_folder
9898
```
9999
100-
To upload the images as a data asset, you run the following CLI v2 command with the path to your .yml file, workspace name, resource group and subscription ID.
100+
To upload the images as a data asset, you run the following CLI v2 command with the path to your .yml file, workspace name, resource group, and subscription ID.
101101
102102
```azurecli
103103
az ml data create -f [PATH_TO_YML_FILE] --workspace-name [YOUR_AZURE_WORKSPACE] --resource-group [YOUR_AZURE_RESOURCE_GROUP] --subscription [YOUR_AZURE_SUBSCRIPTION]
@@ -120,7 +120,7 @@ If you already have your data present in an existing datastore and want to creat
120120
# [Azure CLI](#tab/cli)
121121
[!INCLUDE [cli v2](includes/machine-learning-cli-v2.md)]
122122

123-
Create a .yml file with the following configuration.
123+
Create an .yml file with the following configuration.
124124

125125
```yml
126126
$schema: https://azuremlschemas.azureedge.net/latest/data.schema.json
@@ -148,15 +148,15 @@ my_data = Data(
148148

149149
---
150150

151-
Next, you will need to get the label annotations in JSONL format. The schema of labeled data depends on the computer vision task at hand. Refer to [schemas for JSONL files for AutoML computer vision experiments](reference-automl-images-schema.md) to learn more about the required JSONL schema for each task type.
151+
Next, you'll need to get the label annotations in JSONL format. The schema of labeled data depends on the computer vision task at hand. Refer to [schemas for JSONL files for AutoML computer vision experiments](reference-automl-images-schema.md) to learn more about the required JSONL schema for each task type.
152152

153153
If your training data is in a different format (like, pascal VOC or COCO), [helper scripts](https://github.com/Azure/azureml-examples/blob/v1-archive/v1/python-sdk/tutorials/automl-with-azureml/image-object-detection/coco2jsonl.py) to convert the data to JSONL are available in [notebook examples](https://github.com/Azure/azureml-examples/blob/main/sdk/python/jobs/automl-standalone-jobs).
154154

155155
Once you have created jsonl file following the above steps, you can register it as a data asset using UI. Make sure you select `stream` type in schema section as shown below.
156156

157157
![Animation showing how to register a data asset from the jsonl files](media\how-to-prepare-datasets-for-automl-images\ui-dataset-jsnol.gif)
158158

159-
### Using pre-labeled training data from Azure Blob storage
159+
### Using prelabeled training data from Azure Blob storage
160160
If you have your labeled training data present in a container in Azure Blob storage, then you can access it directly from there by [creating a datastore referring to that container](how-to-datastore.md#create-an-azure-blob-datastore).
161161

162162
## Create MLTable

0 commit comments

Comments
 (0)