Skip to content

Commit e2c9228

Browse files
authored
Merge pull request #199438 from ssalgadodev/runsToJobsPart7
Runs to Jobs
2 parents 499002e + 432ced1 commit e2c9228

9 files changed

+57
-55
lines changed

articles/machine-learning/how-to-move-workspace.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ Moving the workspace enables you to migrate the workspace and its contents as a
2626
| Workspace contents | Moved with workspace |
2727
| ----- |:-----:|
2828
| Datasets | Yes |
29-
| Experiment runs | Yes |
29+
| Experiment jobs | Yes |
3030
| Environments | Yes |
3131
| Models and other assets stored in the workspace | Yes |
3232
| Compute resources | No |
@@ -55,7 +55,7 @@ Moving the workspace enables you to migrate the workspace and its contents as a
5555

5656
| Resource provider | Why it's needed |
5757
| ----- | ----- |
58-
| __Microsoft.DocumentDB/databaseAccounts__ | Azure CosmosDB instance that logs metadata for the workspace. |
58+
| __Microsoft.DocumentDB/databaseAccounts__ | Azure Cosmos DB instance that logs metadata for the workspace. |
5959
| __Microsoft.Search/searchServices__ | Azure Search provides indexing capabilities for the workspace. |
6060

6161
For information on registering resource providers, see [Resolve errors for resource provider registration](/azure/azure-resource-manager/templates/error-register-resource-provider).
@@ -69,7 +69,7 @@ Moving the workspace enables you to migrate the workspace and its contents as a
6969

7070
* Workspace move is not meant for replicating workspaces, or moving individual assets such as models or datasets from one workspace to another.
7171
* Workspace move doesn't support migration across Azure regions or Azure Active Directory tenants.
72-
* The workspace mustn't be in use during the move operation. Verify that all experiment runs, data profiling runs, and labeling projects have completed. Also verify that inference endpoints aren't being invoked.
72+
* The workspace mustn't be in use during the move operation. Verify that all experiment jobs, data profiling jobs, and labeling projects have completed. Also verify that inference endpoints aren't being invoked.
7373
* The workspace will become unavailable during the move.
7474
* Before to the move, you must delete or detach computes and inference endpoints from the workspace.
7575

@@ -81,7 +81,7 @@ Moving the workspace enables you to migrate the workspace and its contents as a
8181
az account set -s origin-sub-id
8282
```
8383
84-
2. Verify that the origin workspace isn't being used. Check that any experiment runs, data profiling runs, or labeling projects have completed. Also verify that inferencing endpoints aren't being invoked.
84+
2. Verify that the origin workspace isn't being used. Check that any experiment jobs, data profiling jobs, or labeling projects have completed. Also verify that inferencing endpoints aren't being invoked.
8585
8686
3. Delete or detach any computes from the workspace, and delete any inferencing endpoints. Moving computes and endpoints isn't supported. Also note that the workspace will become unavailable during the move.
8787

articles/machine-learning/how-to-responsible-ai-scorecard.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ Azure Machine Learning’s Responsible AI dashboard is designed for machine lear
2222
- While an end-to-end machine learning life cycle includes both technical and non-technical stakeholders in the loop, there's very little support to enable an effective multi-stakeholder alignment, helping technical experts get timely feedback and direction from the non-technical stakeholders.
2323
- AI regulations make it essential to be able to share model and data insights with auditors and risk officers for auditability purposes.
2424

25-
One of the biggest benefits of using the Azure Machine Learning ecosystem is related to the archival of model and data insights in the Azure Machine Learning Run History (for quick reference in future). As a part of that infrastructure and to accompany machine learning models and their corresponding Responsible AI dashboards, we introduce the Responsible AI scorecard, a customizable report that you can easily configure, download, and share with your technical and non-technical stakeholders to educate them about your data and model health and compliance and build trust. This scorecard could also be used in audit reviews to inform the stakeholders about the characteristics of your model.
25+
One of the biggest benefits of using the Azure Machine Learning ecosystem is related to the archival of model and data insights in the Azure Machine Learning Job History (for quick reference in future). As a part of that infrastructure and to accompany machine learning models and their corresponding Responsible AI dashboards, we introduce the Responsible AI scorecard, a customizable report that you can easily configure, download, and share with your technical and non-technical stakeholders to educate them about your data and model health and compliance and build trust. This scorecard could also be used in audit reviews to inform the stakeholders about the characteristics of your model.
2626

2727
## Who should use a Responsible AI scorecard?
2828

articles/machine-learning/how-to-retrain-designer.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -74,7 +74,7 @@ For this example, you will change the training data path from a fixed value to a
7474
> - After detaching, you can delete the pipeline parameter in the **Setings** pane.
7575
> - You can also add a pipeline parameter in the **Settings** pane, and then apply it on some component parameter.
7676
77-
1. Submit the pipeline run.
77+
1. Submit the pipeline job.
7878

7979
## Publish a training pipeline
8080

@@ -90,21 +90,21 @@ Publish a pipeline to a pipeline endpoint to easily reuse your pipelines in the
9090

9191
## Retrain your model
9292

93-
Now that you have a published training pipeline, you can use it to retrain your model on new data. You can submit runs from a pipeline endpoint from the studio workspace or programmatically.
93+
Now that you have a published training pipeline, you can use it to retrain your model on new data. You can submit jobs from a pipeline endpoint from the studio workspace or programmatically.
9494

95-
### Submit runs by using the studio portal
95+
### Submit jobs by using the studio portal
9696

97-
Use the following steps to submit a parameterized pipeline endpoint run from the studio portal:
97+
Use the following steps to submit a parameterized pipeline endpoint job from the studio portal:
9898

9999
1. Go to the **Endpoints** page in your studio workspace.
100100
1. Select the **Pipeline endpoints** tab. Then, select your pipeline endpoint.
101101
1. Select the **Published pipelines** tab. Then, select the pipeline version that you want to run.
102102
1. Select **Submit**.
103-
1. In the setup dialog box, you can specify the parameters values for the run. For this example, update the data path to train your model using a non-US dataset.
103+
1. In the setup dialog box, you can specify the parameters values for the job. For this example, update the data path to train your model using a non-US dataset.
104104

105-
![Screenshot that shows how to set up a parameterized pipeline run in the designer](./media/how-to-retrain-designer/published-pipeline-run.png)
105+
![Screenshot that shows how to set up a parameterized pipeline job in the designer](./media/how-to-retrain-designer/published-pipeline-run.png)
106106

107-
### Submit runs by using code
107+
### Submit jobs by using code
108108

109109
You can find the REST endpoint of a published pipeline in the overview panel. By calling the endpoint, you can retrain the published pipeline.
110110

@@ -116,4 +116,4 @@ In this article, you learned how to create a parameterized training pipeline end
116116

117117
For a complete walkthrough of how you can deploy a model to make predictions, see the [designer tutorial](tutorial-designer-automobile-price-train-score.md) to train and deploy a regression model.
118118

119-
For how to publish and submit a run to pipeline endpoint using SDK, see [this article](how-to-deploy-pipelines.md).
119+
For how to publish and submit a job to pipeline endpoint using SDK, see [this article](how-to-deploy-pipelines.md).

articles/machine-learning/how-to-run-batch-predictions-designer.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -101,9 +101,9 @@ Now you're ready to deploy the inference pipeline. This will deploy the pipeline
101101

102102
Now, you have a published pipeline with a dataset parameter. The pipeline will use the trained model created in the training pipeline to score the dataset you provide as a parameter.
103103

104-
### Submit a pipeline run
104+
### Submit a pipeline job
105105

106-
In this section, you'll set up a manual pipeline run and alter the pipeline parameter to score new data.
106+
In this section, you'll set up a manual pipeline job and alter the pipeline parameter to score new data.
107107

108108
1. After the deployment is complete, go to the **Endpoints** section.
109109

@@ -119,7 +119,7 @@ In this section, you'll set up a manual pipeline run and alter the pipeline para
119119

120120
1. Select the pipeline you published.
121121

122-
The pipeline details page shows you a detailed run history and connection string information for your pipeline.
122+
The pipeline details page shows you a detailed job history and connection string information for your pipeline.
123123

124124
1. Select **Submit** to create a manual run of the pipeline.
125125

@@ -133,7 +133,7 @@ In this section, you'll set up a manual pipeline run and alter the pipeline para
133133

134134
You can find information on how to consume pipeline endpoints and published pipeline in the **Endpoints** section.
135135

136-
You can find the REST endpoint of a pipeline endpoint in the run overview panel. By calling the endpoint, you're consuming its default published pipeline.
136+
You can find the REST endpoint of a pipeline endpoint in the job overview panel. By calling the endpoint, you're consuming its default published pipeline.
137137

138138
You can also consume a published pipeline in the **Published pipelines** page. Select a published pipeline and you can find the REST endpoint of it in the **Published pipeline overview** panel to the right of the graph.
139139

articles/machine-learning/how-to-save-write-experiment-files.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -18,13 +18,13 @@ ms.date: 03/10/2020
1818

1919
In this article, you learn where to save input files, and where to write output files from your experiments to prevent storage limit errors and experiment latency.
2020

21-
When launching training runs on a [compute target](concept-compute-target.md), they are isolated from outside environments. The purpose of this design is to ensure reproducibility and portability of the experiment. If you run the same script twice, on the same or another compute target, you receive the same results. With this design, you can treat compute targets as stateless computation resources, each having no affinity to the jobs that are running after they are finished.
21+
When launching training jobs on a [compute target](concept-compute-target.md), they are isolated from outside environments. The purpose of this design is to ensure reproducibility and portability of the experiment. If you run the same script twice, on the same or another compute target, you receive the same results. With this design, you can treat compute targets as stateless computation resources, each having no affinity to the jobs that are running after they are finished.
2222

2323
## Where to save input files
2424

2525
Before you can initiate an experiment on a compute target or your local machine, you must ensure that the necessary files are available to that compute target, such as dependency files and data files your code needs to run.
2626

27-
Azure Machine Learning runs training scripts by copying the entire source directory. If you have sensitive data that you don't want to upload, use a [.ignore file](how-to-save-write-experiment-files.md#storage-limits-of-experiment-snapshots) or don't include it in the source directory. Instead, access your data using a [datastore](/python/api/azureml-core/azureml.data).
27+
Azure Machine Learning jobs training scripts by copying the entire source directory. If you have sensitive data that you don't want to upload, use a [.ignore file](how-to-save-write-experiment-files.md#storage-limits-of-experiment-snapshots) or don't include it in the source directory. Instead, access your data using a [datastore](/python/api/azureml-core/azureml.data).
2828

2929
The storage limit for experiment snapshots is 300 MB and/or 2000 files.
3030

@@ -38,7 +38,7 @@ For this reason, we recommend:
3838

3939
### Storage limits of experiment snapshots
4040

41-
For experiments, Azure Machine Learning automatically makes an experiment snapshot of your code based on the directory you suggest when you configure the run. This has a total limit of 300 MB and/or 2000 files. If you exceed this limit, you'll see the following error:
41+
For experiments, Azure Machine Learning automatically makes an experiment snapshot of your code based on the directory you suggest when you configure the job. This has a total limit of 300 MB and/or 2000 files. If you exceed this limit, you'll see the following error:
4242

4343
```Python
4444
While attempting to take snapshot of .
@@ -56,18 +56,18 @@ Jupyter notebooks| Create a `.amlignore` file or move your notebook into a new,
5656

5757
## Where to write files
5858

59-
Due to the isolation of training experiments, the changes to files that happen during runs are not necessarily persisted outside of your environment. If your script modifies the files local to compute, the changes are not persisted for your next experiment run, and they're not propagated back to the client machine automatically. Therefore, the changes made during the first experiment run don't and shouldn't affect those in the second.
59+
Due to the isolation of training experiments, the changes to files that happen during jobs are not necessarily persisted outside of your environment. If your script modifies the files local to compute, the changes are not persisted for your next experiment job, and they're not propagated back to the client machine automatically. Therefore, the changes made during the first experiment job don't and shouldn't affect those in the second.
6060

6161
When writing changes, we recommend writing files to storage via an Azure Machine Learning dataset with an [OutputFileDatasetConfig object](/python/api/azureml-core/azureml.data.output_dataset_config.outputfiledatasetconfig). See [how to create an OutputFileDatasetConfig](how-to-train-with-datasets.md#where-to-write-training-output).
6262

6363
Otherwise, write files to the `./outputs` and/or `./logs` folder.
6464

6565
>[!Important]
66-
> Two folders, *outputs* and *logs*, receive special treatment by Azure Machine Learning. During training, when you write files to`./outputs` and`./logs` folders, the files will automatically upload to your run history, so that you have access to them once your run is finished.
66+
> Two folders, *outputs* and *logs*, receive special treatment by Azure Machine Learning. During training, when you write files to`./outputs` and`./logs` folders, the files will automatically upload to your job history, so that you have access to them once your job is finished.
6767
68-
* **For output such as status messages or scoring results,** write files to the `./outputs` folder, so they are persisted as artifacts in run history. Be mindful of the number and size of files written to this folder, as latency may occur when the contents are uploaded to run history. If latency is a concern, writing files to a datastore is recommended.
68+
* **For output such as status messages or scoring results,** write files to the `./outputs` folder, so they are persisted as artifacts in job history. Be mindful of the number and size of files written to this folder, as latency may occur when the contents are uploaded to job history. If latency is a concern, writing files to a datastore is recommended.
6969

70-
* **To save written file as logs in run history,** write files to `./logs` folder. The logs are uploaded in real time, so this method is suitable for streaming live updates from a remote run.
70+
* **To save written file as logs in job history,** write files to `./logs` folder. The logs are uploaded in real time, so this method is suitable for streaming live updates from a remote job.
7171

7272
## Next steps
7373

0 commit comments

Comments
 (0)