Skip to content

Commit 5daee44

Browse files
authored
Merge pull request #114127 from Blackmist/amlignore
Amlignore/Gitignore info
2 parents bdde697 + b7b5dd9 commit 5daee44

5 files changed

+14
-5
lines changed

articles/machine-learning/concept-azure-machine-learning-architecture.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -115,7 +115,7 @@ For example run configurations, see [Select and use a compute target to train yo
115115
When you submit a run, Azure Machine Learning compresses the directory that contains the script as a zip file and sends it to the compute target. The zip file is then extracted, and the script is run there. Azure Machine Learning also stores the zip file as a snapshot as part of the run record. Anyone with access to the workspace can browse a run record and download the snapshot.
116116

117117
> [!NOTE]
118-
> To prevent unnecessary files from being included in the snapshot, make an ignore file (.gitignore or .amlignore). Place this file in the Snapshot directory and add the filenames to ignore in it. The .amlignore file uses the same [syntax and patterns as the .gitignore file](https://git-scm.com/docs/gitignore). If both files exist, the .amlignore file takes precedence.
118+
> [!INCLUDE [amlinclude-info](../../includes/machine-learning-amlignore-gitignore.md)]
119119
120120
### GitHub tracking and integration
121121

articles/machine-learning/how-to-create-your-first-pipeline.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -325,7 +325,7 @@ pipeline1 = Pipeline(workspace=ws, steps=steps)
325325

326326
### Use a dataset
327327

328-
Datasets created from Azure Blob storage, Azure Files, Azure Data Lake Storage Gen1, Azure Data Lake Storage Gen2, Azure SQL Database, and Azure Database for PostgreSQL can be used as input to any pipeline step. You can write output to a [DataTransferStep](https://docs.microsoft.com/python/api/azureml-pipeline-steps/azureml.pipeline.steps.datatransferstep?view=azure-ml-py), [DatabricksStep](https://docs.microsoft.com/python/api/azureml-pipeline-steps/azureml.pipeline.steps.databricks_step.databricksstep?view=azure-ml-py), or if you want to write data to a specific datasore use [PipelineData](https://docs.microsoft.com/python/api/azureml-pipeline-core/azureml.pipeline.core.pipelinedata?view=azure-ml-py).
328+
Datasets created from Azure Blob storage, Azure Files, Azure Data Lake Storage Gen1, Azure Data Lake Storage Gen2, Azure SQL Database, and Azure Database for PostgreSQL can be used as input to any pipeline step. You can write output to a [DataTransferStep](https://docs.microsoft.com/python/api/azureml-pipeline-steps/azureml.pipeline.steps.datatransferstep?view=azure-ml-py), [DatabricksStep](https://docs.microsoft.com/python/api/azureml-pipeline-steps/azureml.pipeline.steps.databricks_step.databricksstep?view=azure-ml-py), or if you want to write data to a specific datastore use [PipelineData](https://docs.microsoft.com/python/api/azureml-pipeline-core/azureml.pipeline.core.pipelinedata?view=azure-ml-py).
329329

330330
> [!IMPORTANT]
331331
> Writing output data back to a datastore using PipelineData is only supported for Azure Blob and Azure File share datastores. This functionality is not supported for [ADLS Gen 2 datastores](https://docs.microsoft.com/python/api/azureml-core/azureml.data.azure_data_lake_datastore.azuredatalakegen2datastore?view=azure-ml-py) at this time.
@@ -365,7 +365,7 @@ For more detail, including alternate ways to pass and access data, see [Moving d
365365
When you submit the pipeline, Azure Machine Learning checks the dependencies for each step and uploads a snapshot of the source directory you specified. If no source directory is specified, the current local directory is uploaded. The snapshot is also stored as part of the experiment in your workspace.
366366

367367
> [!IMPORTANT]
368-
> To prevent files from being included in the snapshot, create a [.gitignore](https://git-scm.com/docs/gitignore) or `.amlignore` file in the directory and add the files to it. The `.amlignore` file uses the same syntax and patterns as the [.gitignore](https://git-scm.com/docs/gitignore) file. If both files exist, the `.amlignore` file takes precedence.
368+
> [!INCLUDE [amlinclude-info](../../includes/machine-learning-amlignore-gitignore.md)]
369369
>
370370
> For more information, see [Snapshots](concept-azure-machine-learning-architecture.md#snapshots).
371371

articles/machine-learning/how-to-save-write-experiment-files.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ To resolve this error, store your experiment files on a datastore. If you can't
4949
Experiment description|Storage limit solution
5050
---|---
5151
Less than 2000 files & can't use a datastore| Override snapshot size limit with <br> `azureml._restclient.snapshots_client.SNAPSHOT_MAX_SIZE_BYTES = 'insert_desired_size'`<br> This may take several minutes depending on the number and size of files.
52-
Must use specific script directory| Make a `.amlignore` file to exclude files from your experiment snapshot that are not part of the source code. Add the filenames to the `.amlignore` file and place it in the same directory as your training script. The `.amlignore` file uses the same [syntax and patterns](https://git-scm.com/docs/gitignore) as a `.gitignore` file.
52+
Must use specific script directory| [!INCLUDE [amlinclude-info](../../includes/machine-learning-amlignore-gitignore.md)]
5353
Pipeline|Use a different subdirectory for each step
5454
Jupyter notebooks| Create a `.amlignore` file or move your notebook into a new, empty, subdirectory and run your code again.
5555

articles/machine-learning/how-to-set-up-training-targets.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -360,7 +360,7 @@ After you create a run configuration, you use it to run your experiment. The co
360360
> [!IMPORTANT]
361361
> When you submit the training run, a snapshot of the directory that contains your training scripts is created and sent to the compute target. It is also stored as part of the experiment in your workspace. If you change files and submit the run again, only the changed files will be uploaded.
362362
>
363-
> To prevent files from being included in the snapshot, create a [.gitignore](https://git-scm.com/docs/gitignore) or `.amlignore` file in the directory and add the files to it. The `.amlignore` file uses the same syntax and patterns as the [.gitignore](https://git-scm.com/docs/gitignore) file. If both files exist, the `.amlignore` file takes precedence.
363+
> [!INCLUDE [amlinclude-info](../../includes/machine-learning-amlignore-gitignore.md)]
364364
>
365365
> For more information, see [Snapshots](concept-azure-machine-learning-architecture.md#snapshots).
366366
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
---
2+
author: blackmist
3+
ms.service: machine-learning
4+
ms.topic: include
5+
ms.date: 05/06/2020
6+
ms.author: larryfr
7+
---
8+
9+
To prevent unnecessary files from being included in the snapshot, make an ignore file (`.gitignore` or `.amlignore`) in the directory. Add the files and directories to exclude to this file. For more information on the syntax to use inside this file, see [syntax and patterns](https://git-scm.com/docs/gitignore) for `.gitignore`. The `.amlignore` file uses the same syntax. _If both files exist, the `.amlignore` file takes precedence._

0 commit comments

Comments
 (0)