Skip to content

Commit a261bf7

Browse files
committed
PRS GA upadates
1 parent 8dacda0 commit a261bf7

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

articles/machine-learning/how-to-use-parallel-run-step.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -17,9 +17,9 @@ ms.custom: Build2019
1717
# Run batch inference on large amounts of data by using Azure Machine Learning
1818
[!INCLUDE [applies-to-skus](../../includes/aml-applies-to-basic-enterprise-sku.md)]
1919

20-
Learn how to process large amounts of data asynchronously and in parallel by using Azure Machine Learning. The ParallelRunStep is a high-performance and high-throughput way to generate inferences and processing data. It provides parallelism capabilities out of the box.
20+
Learn how to run batch inference on large amounts of data asynchronously and in parallel by using Azure Machine Learning. The ParallelRunStep is a high-performance and high-throughput way to generate inferences and processing data. It provides parallelism capabilities out of the box.
2121

22-
With ParallelRunStep, it's straightforward to scale offline inferences to large clusters of machines on terabytes of structured or unstructured data resulting in improved productivity and optimized cost.
22+
With ParallelRunStep, it's straightforward to scale offline inferences to large clusters of machines on terabytes of structured or unstructured data with improved productivity and optimized cost.
2323

2424
In this article, you learn the following tasks:
2525

@@ -265,7 +265,7 @@ Now you have everything you need: the data inputs, the model, the output and you
265265

266266
First, specify the dependencies for your script. This allows you to install pip packages as well as configure the environment. Please always include **azureml-core** and **azureml-dataprep[pandas, fuse]** packages.
267267

268-
If you use custom docker image, you should also have conda installed.
268+
If you use a custom docker image (user_managed_dependencies=True), you should also have conda installed.
269269

270270
```python
271271
from azureml.core.environment import Environment
@@ -327,9 +327,9 @@ Create the ParallelRunStep by using the script, environment configuration, and p
327327
- `name`: The name of the step, with the following naming restrictions: unique, 3-32 characters, and regex ^\[a-z\]([-a-z0-9]*[a-z0-9])?$.
328328
- `parallel_run_config`: A `ParallelRunConfig` object, as defined earlier.
329329
- `inputs`: One or more single-typed Azure Machine Learning datasets to be partitioned for parallel processing.
330-
- `side_inputs`: One or more reference data or datasets used as side inputs. No need to be partitioned.
330+
- `side_inputs`: One or more reference data or datasets used as side inputs without need to be partitioned.
331331
- `output`: A `PipelineData` object that corresponds to the output directory.
332-
- `arguments`: A list of arguments passed to the user script (optional).
332+
- `arguments`: A list of arguments passed to the user script. Use unknown_args to retrieve them in your entry script (optional).
333333
- `allow_reuse`: Whether the step should reuse previous results when run with the same settings/inputs. If this parameter is `False`, a new run will always be generated for this step during pipeline execution. (optional; the default value is `True`.)
334334

335335
```python

0 commit comments

Comments
 (0)