|
| 1 | +# Available DLCs on AWS |
| 2 | + |
| 3 | +Below you can find a listing of all the Deep Learning Containers (DLCs) available on AWS. |
| 4 | + |
| 5 | +For each supported combination of use-case (training, inference), accelerator type (CPU, GPU, Neuron), and framework (PyTorch, TGI, TEI) containers are created. |
| 6 | + |
| 7 | +## FAQ |
| 8 | + |
| 9 | +**How to choose the right container for my use case?** |
| 10 | + |
| 11 | +**How to find the URI of my container?** |
| 12 | +The URI is built with an AWS account ID and an AWS region. Those two values need to be replaced depending on your use case. |
| 13 | +Let's say you want to use the training DLC for GPUs in |
| 14 | +- `dlc-aws-account-id`: The AWS account ID of the account that owns the ECR repository. You can find them in the [here](https://github.com/aws/sagemaker-python-sdk/blob/e0b9d38e1e3b48647a02af23c4be54980e53dc61/src/sagemaker/image_uri_config/huggingface.json#L21) |
| 15 | +- `region`: The AWS region where you want to use it. |
| 16 | + |
| 17 | +## Training |
| 18 | + |
| 19 | +Pytorch Training DLC: For training, our DLCs are available for PyTorch via :hugging_face: Transformers. They include support for training on GPUs and AWS AI chips with libraries such as :hugging_face: TRL, Sentence Transformers, or :firecracker: Diffusers. |
| 20 | + |
| 21 | +| Container URI | Accelerator | |
| 22 | +| -------------------------------------------------------------------------------------------------------------------------------- | ----------- | |
| 23 | +| 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training:2.5.1-transformers4.49.0-gpu-py311-cu124-ubuntu22.04 | GPU | |
| 24 | +| 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training-neuronx:2.1.2-transformers4.48.1-neuronx-py310-sdk2.20.0-ubuntu20.04 | Neuron | |
| 25 | + |
| 26 | + |
| 27 | +## Inference |
| 28 | + |
| 29 | +### Pytorch Inference DLC |
| 30 | + |
| 31 | +For inference, we have a general-purpose PyTorch inference DLC, for serving models trained with any of those frameworks mentioned before on CPU, GPU, and AWS AI chips. |
| 32 | + |
| 33 | +| Container URI | Accelerator | |
| 34 | +| -------------------------------------------------------------------------------------------------------------------------------- | ----------- | |
| 35 | +| 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.6.0-transformers4.49.0-cpu-py312-ubuntu22.04- | CPU | |
| 36 | +| 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.6.0-transformers4.49.0-gpu-py312-cu124-ubuntu22.04 | GPU | |
| 37 | +| 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference-neuronx:2.1.2-transformers4.43.2-neuronx-py310-sdk2.20.0-ubuntu20.04 | Neuron | |
| 38 | + |
| 39 | +### Text Generation Inference |
| 40 | + |
| 41 | +There is also the Text Generation Inference (TGI) DLC for high-performance text generation of LLMs on GPU and AWS AI chips. |
| 42 | + |
| 43 | +| Container URI | Accelerator | |
| 44 | +| -------------------------------------------------------------------------------------------------------------------------------- | ----------- | |
| 45 | +| 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-tgi-inference:2.6.0-tgi3.2.3-gpu-py311-cu124-ubuntu22.04 | GPU | |
| 46 | +| 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-tgi-inference:2.1.2-optimum0.0.28-neuronx-py310-ubuntu22.04 | Neuron | |
| 47 | + |
| 48 | +### Text Embedding Inference |
| 49 | + |
| 50 | +Finally, there is a Text Embeddings Inference (TEI) DLC for high-performance serving of embedding models on CPU and GPU. |
| 51 | + |
| 52 | +| Container URI | Accelerator | |
| 53 | +| -------------------------------------------------------------------------------------------------------------------------------- | ----------- | |
| 54 | +| 683313688378.dkr.ecr.us-east-1.amazonaws.com/tei-cpu:2.0.1-tei1.2.3-cpu-py310-ubuntu22.04 | CPU | |
| 55 | +| 683313688378.dkr.ecr.us-east-1.amazonaws.com/tei:2.0.1-tei1.4.0-gpu-py310-cu122-ubuntu22.04 | GPU | |
0 commit comments