Skip to content

Commit 8c023b5

Browse files
authored
Merge pull request #99719 from MayMSFT/patch-26
Update how-to-train-with-datasets.md
2 parents 5d8d340 + 28349c4 commit 8c023b5

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

articles/machine-learning/service/how-to-train-with-datasets.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -102,10 +102,10 @@ experiment_run.wait_for_completion(show_output=True)
102102

103103
If you want to make your data files available on the compute target for training, use [FileDataset](https://docs.microsoft.com/python/api/azureml-core/azureml.data.file_dataset.filedataset?view=azure-ml-py) to mount or download files referred by it.
104104

105-
When you mount a file system, you attach that file system to a directory (mount point) and make it available on the compute target. Mounting is instantaneous because files are loaded only at the time of processing. Mounting is supported and recommended for Linux-based computes, including Azure Machine Learning Compute, virtual machines, and HDInsight. For non-Linux-based compute, only downloading is supported.
105+
### Mount v.s. Download
106+
When you mount a dataset, you attach the files referenced by the dataset to a directory (mount point) and make it available on the compute target. Mounting is supported for Linux-based computes, including Azure Machine Learning Compute, virtual machines, and HDInsight. If your data size exceeds the compute disk size, or you are only loading part of dataset in your script, mounting is recommended. Because downloading a dataset bigger than the disk size will fail, and mounting will only load the part of data used by your script at the time of processing.
107+
When you download a dataset, all the files referenced by the dataset will be downloaded to the compute target. Downloading is supported for all compute types. If your script process all files referenced by the dataset, and your compute disk can fit in your full dataset, downloading is recommended to avoid the overhead of streaming data from storage services.
106108

107-
>[!WARNING]
108-
> If your data size exceeds the storage limit of the compute target, downloading will fail.
109109

110110
### Create a FileDataset
111111

0 commit comments

Comments
 (0)