You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-deploy-model-custom-output.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,7 +29,7 @@ Batch deployments allow you to take control of the output of the jobs by letting
29
29
30
30
## About this sample
31
31
32
-
This example shows how you can deploy a model to perform batch inference and customize how your predictions are written in the output. This example uses a model based on the [UCI Heart Disease dataset](https://archive.ics.uci.edu/ml/datasets/Heart+Disease). The database contains 76 attributes, but we use a subset of 14 of them. The model tries to predict the presence of heart disease in a patient. It's integer valued from 0 (no presence) to 1 (presence).
32
+
This example shows how you can deploy a model to perform batch inference and customize how your predictions are written in the output. The model is based on the [UCI Heart Disease dataset](https://archive.ics.uci.edu/ml/datasets/Heart+Disease). The database contains 76 attributes, but this example uses a subset of 14 of them. The model tries to predict the presence of heart disease in a patient. It's integer valued from 0 (no presence) to 1 (presence).
33
33
34
34
The model was trained using an `XGBBoost` classifier and all the required preprocessing was packaged as a `scikit-learn` pipeline, making this model an end-to-end pipeline that goes from raw data to predictions.
35
35
@@ -51,7 +51,7 @@ There's a Jupyter notebook that you can use to follow this example. In the clone
51
51
52
52
## Create a batch deployment with a custom output
53
53
54
-
In this example, we create a deployment that can write directly to the output folder of the batch deployment job. The deployment uses this feature to write custom parquet files.
54
+
In this example, you create a deployment that can write directly to the output folder of the batch deployment job. The deployment uses this feature to write custom parquet files.
55
55
56
56
### Register the model
57
57
@@ -86,7 +86,7 @@ __Remarks:__
86
86
* The `run` method returns a list of the processed files. It's required for the `run` function to return a `list` or a `pandas.DataFrame` object.
87
87
88
88
> [!WARNING]
89
-
> Take into account that all the batch executors have write access to this path at the same time. This means that you need to account for concurrency. In this case, we ensure that each executor writes its own file by using the input file name as the name of the output folder.
89
+
> Take into account that all the batch executors have write access to this path at the same time. This means that you need to account for concurrency. In this case, ensure that each executor writes its own file by using the input file name as the name of the output folder.
90
90
91
91
## Create the endpoint
92
92
@@ -96,13 +96,13 @@ You now create a batch endpoint named `heart-classifier-batch` where the model i
96
96
97
97
# [Azure CLI](#tab/cli)
98
98
99
-
In this case, let's place the name of the endpoint in a variable so we can easily reference it later.
99
+
In this case, place the name of the endpoint in a variable so you can easily reference it later.
@@ -179,7 +179,7 @@ Follow the next steps to create a deployment using the previous scoring script:
179
179
180
180
To test your endpoint, use a sample of unlabeled data located in this repository, which can be used with the model. Batch endpoints can only process data that's located in the cloud and is accessible from the Azure Machine Learning workspace. In this example, you upload it to an Azure Machine Learning data store. You're going to create a data asset that can be used to invoke the endpoint for scoring. However, notice that batch endpoints accept data that can be placed in multiple type of locations.
181
181
182
-
1.Let's invoke the endpoint with data from a storage account:
182
+
1.Invoke the endpoint with data from a storage account:
183
183
184
184
# [Azure CLI](#tab/cli)
185
185
@@ -213,7 +213,7 @@ To test your endpoint, use a sample of unlabeled data located in this repository
213
213
214
214
## Analyze the outputs
215
215
216
-
The job generates a named output called `score` where all the generated files are placed. Since we wrote into the directory directly, one file per each input file, then we can expect to have the same number of files. In this particular example, we decided to name the output files the same as the inputs, but they have a parquet extension.
216
+
The job generates a named output called `score` where all the generated files are placed. Since you wrote into the directory directly, one file per each input file, then you can expect to have the same number of files. In this particular example, name the output files the same as the inputs, but they have a parquet extension.
217
217
218
218
> [!NOTE]
219
219
> Notice that a file *predictions.csv* is also included in the output folder. This file contains the summary of the processed files.
0 commit comments