|
| 1 | +--- |
| 2 | +title: Azure Container for PyTorch |
| 3 | +titleSuffix: Azure Machine Learning |
| 4 | +description: Azure Container for PyTorch (ACPT), a curated environment that includes the best of Microsoft technologies for training with PyTorch on Azure. |
| 5 | +services: machine-learning |
| 6 | +author: sheetalarkadam |
| 7 | +ms.author: parinitarahi |
| 8 | +ms.reviewer: ssalgado |
| 9 | +ms.service: machine-learning |
| 10 | +ms.subservice: core |
| 11 | +ms.topic: reference |
| 12 | +ms.date: 10/21/2021 |
| 13 | +--- |
| 14 | + |
| 15 | + |
| 16 | + |
| 17 | +# Azure Container for PyTorch (ACPT) |
| 18 | + |
| 19 | + |
| 20 | + |
| 21 | +Azure Container for PyTorch is a lightweight, standalone environment that includes needed components to effectively run optimized training for large models on AzureML. The AzureML [curated environments](https://learn.microsoft.com/en-us/azure/machine-learning/resource-curated-environments) are available in the user’s workspace by default and are backed by cached Docker images that use the latest version of the AzureML SDK. It helps with reducing preparation costs and faster deployment time. ACPT can be used to quickly get started with various deep learning tasks with PyTorch on Azure. |
| 22 | + |
| 23 | + |
| 24 | + |
| 25 | +> [!NOTE] |
| 26 | +> Use the [Python SDK](how-to-use-environments.md), [CLI](/cli/azure/ml/environment#az-ml-environment-list), or Azure Machine Learning [studio](how-to-manage-environments-in-studio.md) to get the full list of environments and their dependencies. For more information, see the [environments article](how-to-use-environments.md#use-a-curated-environment). |
| 27 | +
|
| 28 | + |
| 29 | + |
| 30 | +## Why should I use ACPT? |
| 31 | + |
| 32 | + |
| 33 | + |
| 34 | +* As-IS use with pre-installed packages or build on top of the curated environment |
| 35 | +* Optimized Training framework to set up, develop, accelerate PyTorch model on large workloads. |
| 36 | +* Up-to-date stack with the latest compatible versions of Ubuntu, Python, PyTorch, CUDA\RocM, etc. |
| 37 | +* Ease of use: All components installed and validated against dozens of Microsoft workloads to reduce setup costs and accelerate time to value |
| 38 | +* Latest Training Optimization Technologies: [ONNX RunTime](https://onnxruntime.ai/) , [DeepSpeed](https://www.deepspeed.ai/), [MSCCL](https://github.com/microsoft/msccl), and others.. |
| 39 | +* Integration with Azure ML: Track your PyTorch experiments on ML Studio or using the AML SDK |
| 40 | +* The image is also available as a [DSVM](https://azure.microsoft.com/en-us/products/virtual-machines/data-science-virtual-machines/) |
| 41 | +* Azure Customer Support Reduces training and deployment latency. |
| 42 | +* Improves training and deployment success rate. |
| 43 | +* Avoid unnecessary image builds. |
| 44 | +* Only have required dependencies and access right in the image/container. |
| 45 | + |
| 46 | + |
| 47 | + |
| 48 | +>[!IMPORTANT] |
| 49 | +> To view more information about curated environment packages and versions, visit the Environments tab in the Azure Machine Learning [studio](./how-to-manage-environments-in-studio.md). |
| 50 | +
|
| 51 | + |
| 52 | + |
| 53 | +### Azure Container for PyTorch (ACPT) |
| 54 | + |
| 55 | + |
| 56 | + |
| 57 | + |
| 58 | +**Description**: The Azure Curated Environment for PyTorch is our latest PyTorch curated environment. It is optimized for large, distributed deep learning workloads and comes pre-packaged with the best of Microsoft technologies for accelerated training, e.g., OnnxRuntime Training (ORT), DeepSpeed, MSCCL, etc. |
| 59 | + |
| 60 | + |
| 61 | + |
| 62 | +The following configurations are supported: |
| 63 | + |
| 64 | + |
| 65 | + |
| 66 | +| Environment Name | OS | GPU Version| Python Version | PyTorch Version | ORT-training Version | DeepSpeed Version | torch-ort Version | |
| 67 | +| --- | --- | --- | --- | --- | --- | --- | --- | |
| 68 | +|acpt-pytorch-2.0-cuda11.7 | Ubuntu 20.04 | cu117|3.8| 2.0 | 1.14.1 | 0.8.2 | 0.15.1 |
| 69 | +|acpt-pytorch-1.13-cuda11.7 | Ubuntu 20.04 | cu117 | 3.8 | 1.13.1 | 1.14.0 | 0.8.0 | 1.14.0 | |
| 70 | +| acpt-pytorch-1.12-py39-cuda11.6 | Ubuntu 20.04 | cu116 | 3.9 | 1.12.1 | 1.13.1 | 0.7.3 | 1.13.1 | |
| 71 | +| acpt-pytorch-1.12-cuda11.6 | Ubuntu 20.04 | cu116 | 3.8 | 1.12.1 | 1.13.1 | 0.7.3 | 1.13.1 | |
| 72 | +|acpt-pytorch-1.11-cuda11.5 | Ubuntu 20.04 | cu115 | 3.8 | 1.11.0 | 1.11.1 | 0.7.3 | 1.11.0 | |
| 73 | +|acpt-pytorch-1.11-cuda11.5 | Ubuntu 20.04 | cu113 | 3.8 | 1.11.0 | 1.11.1 | 0.7.3 | 1.11.1 | |
| 74 | + |
| 75 | + |
| 76 | + |
| 77 | +Other packages like fairscale, horovod, msccl, protobuf, pyspark, pytest,pytorch-lightning, tensorboard, NebulaML, torchvision, torchmetrics to support all training needs |
| 78 | + |
| 79 | + |
| 80 | + |
| 81 | +## Support |
| 82 | +Version updates for supported environments, including the base images they reference, are released every two weeks to address vulnerabilities no older than 30 days. Based on usage, some environments may be deprecated (hidden from the product but usable) to support more common machine learning scenarios. |
| 83 | + |
| 84 | + |
| 85 | + |
| 86 | +## References |
| 87 | +https://learn.microsoft.com/en-us/azure/machine-learning/resource-curated-environments |
| 88 | + |
| 89 | + |
| 90 | + |
| 91 | +https://learn.microsoft.com/en-us/azure/machine-learning/data-science-virtual-machine/overview |
0 commit comments