You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Based on the task type, you can create automl image jobs using task specific `automl` functions.
85
91
86
92
For example:
@@ -189,7 +195,7 @@ Once your data is in JSONL format, you can create training and validation `MLTab
189
195
190
196
Automated ML doesn't impose any constraints on training or validation data size for computer vision tasks. Maximum dataset size is only limited by the storage layer behind the dataset (i.e. blob store). There's no minimum number of images or labels. However, we recommend starting with a minimum of 10-15 samples per label to ensure the output model is sufficiently trained. The higher the total number of labels/classes, the more samples you need per label.
@@ -286,7 +296,7 @@ In general, deep learning model performance can often improve with more data. Da
286
296
287
297
Before doing a large sweep to search for the optimal models and hyperparameters, we recommend trying the default values to get a first baseline. Next, you can explore multiple hyperparameters for the same model before sweeping over multiple models and their parameters. This way, you can employ a more iterative approach, because with multiple models and multiple hyperparameters for each, the search space grows exponentially and you need more iterations to find optimal configurations.
If you wish to use the default hyperparameter values for a given algorithm (say yolov5), you can specify it using model_name parameter in set_image_model method of the task specific `automl` job. For example,
302
314
@@ -318,7 +330,7 @@ The primary metric used for model optimization and hyperparameter tuning depends
318
330
### Experiment budget
319
331
320
332
You can optionally specify the maximum time budget for your AutoML Vision training job using the `timeout` parameter in the `limits`- the amount of time in minutes before the experiment terminates. If none specified, default experiment timeout is seven days (maximum 60 days). For example,
Review detailed code examples and use cases in the [azureml-examples repository for automated machine learning samples](https://github.com/Azure/azureml-examples/tree/sdk-preview/cli/jobs/automl-standalone-jobs).
Review detailed code examples and use cases in the [GitHub notebook repository for automated machine learning samples](https://github.com/Azure/azureml-examples/tree/sdk-preview/sdk/jobs/automl-standalone-jobs).
@@ -45,7 +45,9 @@ You can seamlessly integrate with the [Azure Machine Learning data labeling](how
45
45
46
46
* This article assumes some familiarity with setting up an automated machine learning experiment. Follow the [how-to](how-to-configure-auto-train.md) to see the main automated machine learning experiment design patterns.
@@ -180,15 +182,17 @@ However, there are key differences:
180
182
* If you enable long range text, then a GPU with higher memory is required such as, [NCv3](../virtual-machines/ncv3-series.md) series or [ND](../virtual-machines/nd-series.md) series.
181
183
* The `enable_long_range_text` parameter is only available for multi-class classification tasks.
Multi-class text classification|`"eng"` <br> `"deu"` <br> `"mul"`| English BERT [cased](https://huggingface.co/bert-base-cased)<br> [Multilingual BERT](https://huggingface.co/bert-base-multilingual-cased) <br><br>For all other languages, automated ML applies multilingual BERT
229
233
Named entity recognition (NER)|`"eng"` <br> `"deu"` <br> `"mul"`| English BERT [cased](https://huggingface.co/bert-base-cased) <br> [German BERT](https://huggingface.co/bert-base-german-cased)<br> [Multilingual BERT](https://huggingface.co/bert-base-multilingual-cased) <br><br>For all other languages, automated ML applies multilingual BERT
See the following sample YAML files for each NLP task.
311
324
312
325
*[Multi-class text classification](https://github.com/Azure/azureml-examples/blob/main/cli/jobs/automl-standalone-jobs/cli-automl-text-classification-newsgroup/cli-automl-text-classification-newsgroup.yml)
313
326
*[Multi-label text classification](https://github.com/Azure/azureml-examples/blob/main/cli/jobs/automl-standalone-jobs/cli-automl-text-classification-multilabel-paper-cat/cli-automl-text-classification-multilabel-paper-cat.yml)
@@ -50,7 +50,7 @@ If you already have a data labeling project and you want to use that data, you c
50
50
### Using pre-labeled training data
51
51
If you have previously labeled data that you would like to use to train your model, you will first need to upload the images to the default Azure Blob Storage of your Azure ML Workspace and register it as a data asset.
Create a .yml file with the following configuration.
@@ -69,7 +69,10 @@ To upload the images as a data asset, you run the following CLI v2 command with
69
69
az ml data create -f [PATH_TO_YML_FILE] --workspace-name [YOUR_AZURE_WORKSPACE] --resource-group [YOUR_AZURE_RESOURCE_GROUP] --subscription [YOUR_AZURE_SUBSCRIPTION]
0 commit comments