Merge pull request #227050 from santiagxf/santiagxf/azureml-batch-outputs

prmerger-automator[bot] · web-flow · commit cac5903ae559 · 2023-02-10T23:09:26.000Z
Update how-to-use-batch-endpoint.md
diff --git a/articles/machine-learning/how-to-use-batch-endpoint.md b/articles/machine-learning/how-to-use-batch-endpoint.md
@@ -493,7 +493,7 @@ Invoking a batch endpoint triggers a batch scoring job. A job `name` will be ret
 ```python
 job = ml_client.batch_endpoints.invoke(
     endpoint_name=endpoint_name,
-    inputs=Input(path="https://pipelinedata.blob.core.windows.net/sampledata/mnist", type=AssetTypes.URI_FOLDER)
+    inputs=Input(path="https://azuremlexampledata.blob.core.windows.net/data/mnist/sample/", type=AssetTypes.URI_FOLDER)
 )
 ```
 
@@ -511,7 +511,7 @@ job = ml_client.batch_endpoints.invoke(
     :::image type="content" source="./media/how-to-use-batch-endpoints-studio/job-setting-batch-scoring.png" alt-text="Screenshot of using the deployment to submit a batch job.":::
 
 1. Select __Next__.
-1. On __Select data source__, select the data input you want to use. For this example, select __Datastore__ and in the section __Path__ enter the full URL `https://pipelinedata.blob.core.windows.net/sampledata/mnist`. Notice that this only works because the given path has public access enabled. In general, you'll need to register the data source as a __Datastore__. See [Accessing data from batch endpoints jobs](how-to-access-data-batch-endpoints-jobs.md) for details.
+1. On __Select data source__, select the data input you want to use. For this example, select __Datastore__ and in the section __Path__ enter the full URL `https://azuremlexampledata.blob.core.windows.net/data/mnist/sample`. Notice that this only works because the given path has public access enabled. In general, you'll need to register the data source as a __Datastore__. See [Accessing data from batch endpoints jobs](how-to-access-data-batch-endpoints-jobs.md) for details.
 
     :::image type="content" source="./media/how-to-use-batch-endpoints-studio/select-datastore-job.png" alt-text="Screenshot of selecting datastore as an input option.":::
 
@@ -541,38 +541,53 @@ Use `output-path` to configure any folder in an Azure Machine Learning registere
 
 # [Python](#tab/python)
 
-Use `output_path` to configure any folder in an Azure Machine Learning registered datastore. The syntax for the `--output-path` is the same as `--input` when you're specifying a folder, that is, `azureml://datastores/<datastore-name>/paths/<path-on-datastore>/`. Use `output_file_name=<your-file-name>` to configure a new output file name.
+Use `params_override` to configure any folder in an Azure Machine Learning registered data store. Only registered data stores are supported as output paths. In this example we will use the default data store:
+
+```python
+batch_ds = ml_client.datastores.get_default()
+```
+
+Once you identified the data store you want to use, configure the output as follows:
 
 ```python
 job = ml_client.batch_endpoints.invoke(
     endpoint_name=endpoint_name,
-    inputs={ 
-        "input": Input(path="https://pipelinedata.blob.core.windows.net/sampledata/mnist", type=AssetTypes.URI_FOLDER) 
-    },
-    output_path={ 
-        "score": Input(path=f"azureml://datastores/workspaceblobstore/paths/{endpoint_name}") 
-    },
-    output_file_name="predictions.csv"
+    input=Input(
+        path="https://azuremlexampledata.blob.core.windows.net/data/mnist/sample/", 
+        type=AssetTypes.URI_FOLDER
+    ),
+    params_override=[
+        { "output_dataset.datastore_id": f"azureml:{batch_ds.id}" },
+        { "output_dataset.path": "/mnist-batch-results" }
+        { "output_file_name": "mnist-predictions.csv" },
+    ]
 )
 ```
 
 # [Studio](#tab/azure-studio)
 
 1. Navigate to the __Endpoints__ tab on the side menu.
+
 1. Select the tab __Batch endpoints__.
+
 1. Select the batch endpoint you just created.
+
 1. Select __Create job__.
 
     :::image type="content" source="./media/how-to-use-batch-endpoints-studio/create-batch-job.png" alt-text="Screenshot of the create job option to start batch scoring.":::
 
 1. On __Deployment__, select the deployment you want to execute.
+
 1. Select __Next__.
+
 1. Check the option __Override deployment settings__.
 
     :::image type="content" source="./media/how-to-use-batch-endpoints-studio/overwrite-setting.png" alt-text="Screenshot of the overwrite setting when starting a batch job.":::
 
 1. You can now configure __Output file name__ and some extra properties of the deployment execution. Just this execution will be affected.
+
 1. On __Select data source__, select the data input you want to use.
+
 1. On __Configure output location__, check the option __Enable output configuration__.
 
     :::image type="content" source="./media/how-to-use-batch-endpoints-studio/configure-output-location.png" alt-text="Screenshot of optionally configuring output location.":::
@@ -604,7 +619,9 @@ Some settings can be overwritten when invoke to make best use of the compute res
 ```python
 job = ml_client.batch_endpoints.invoke(
     endpoint_name=endpoint_name,
-    input=Input(path="https://pipelinedata.blob.core.windows.net/sampledata/mnist"),
+    input=Input(
+        path="https://azuremlexampledata.blob.core.windows.net/data/mnist/sample/"
+    ),
     params_override=[
         { "mini_batch_size": "20" },
         { "compute.instance_count": "5" }