@@ -258,8 +258,14 @@ AutoGluon Inference Containers
258
258
HuggingFace Training Containers
259
259
===============================
260
260
261
+ Please refer to the following page to view all available versions and tags for GPU containers:
262
+ * [ GPU Release Page] ( https://github.com/aws/deep-learning-containers/releases?q=huggingface-pytorch-training+AND+NOT+neuronx&expanded=true )
263
+
264
+ To get the latest one, you can check the Hugging Face [ documentation] ( https://huggingface.co/docs/sagemaker/en/dlcs/available#training ) .
265
+
261
266
| Framework | Job Type | CPU/GPU | Python Version Options | Example URL |
262
267
| -----------------------------------------------| -----------| -----------| -----------------------| -----------------------------------------------------------------------------------------------------------------------------------|
268
+ | PyTorch 2.5.1 with HuggingFace transformers | training | GPU | 3.11 (py311) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training:2.5.1-transformers4.49.0-gpu-py311-cu124-ubuntu22.04 |
263
269
| PyTorch 2.1.0 with HuggingFace transformers | training | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training:2.1.0-transformers4.36.0-gpu-py310-cu121-ubuntu20.04 |
264
270
| PyTorch 2.0.0 with HuggingFace transformers | training | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training:2.0.0-transformers4.28.1-gpu-py310-cu118-ubuntu20.04 |
265
271
| PyTorch 1.13.1 with HuggingFace transformers | training | GPU | 3.9 (py39) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training:1.13.1-transformers4.26.0-gpu-py39-cu117-ubuntu20.04 |
@@ -269,8 +275,15 @@ HuggingFace Training Containers
269
275
HuggingFace Inference Containers
270
276
===============================
271
277
278
+ Please refer to the following page to view all available versions and tags for GPU and CPU containers:
279
+ * [ GPU and CPU Release Page] ( https://github.com/aws/deep-learning-containers/releases?q=huggingface-pytorch-inference+AND+NOT+tgi+AND+NOT+neuronx&expanded=true )
280
+
281
+ To get the latest one, you can check the Hugging Face [ documentation] ( https://huggingface.co/docs/sagemaker/en/dlcs/available#pytorch-inference-dlc ) .
282
+
272
283
| Framework | Job Type | CPU/GPU | Python Version Options | Example URL |
273
284
| --------------------------------------------------| ------------| -----------| -----------------------| -----------------------------------------------------------------------------------------------------------------------------------|
285
+ | PyTorch 2.6.0 with HuggingFace transformers | inference | CPU | 3.12 (py312) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.6.0-transformers4.49.0-cpu-py312-ubuntu22.04 |
286
+ | PyTorch 2.6.0 with HuggingFace transformers | inference | GPU | 3.12 (py312) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.6.0-transformers4.49.0-gpu-py312-cu124-ubuntu22.04 |
274
287
| PyTorch 2.1.0 with HuggingFace transformers | inference | CPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.1.0-transformers4.37.0-cpu-py310-ubuntu22.04 |
275
288
| PyTorch 2.1.0 with HuggingFace transformers | inference | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.1.0-transformers4.37.0-gpu-py310-cu118-ubuntu20.04 |
276
289
| PyTorch 2.0.0 with HuggingFace transformers | inference | CPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference:2.0.0-transformers4.28.1-cpu-py310-ubuntu20.04 |
@@ -283,25 +296,51 @@ HuggingFace Inference Containers
283
296
HuggingFace Text Generation Inference (TGI) Containers
284
297
===============================
285
298
286
- Please refer to the following pages to view all available versions and tags for GPU and NeuronX containers:
299
+ Please refer to the following pages to view all available versions and tags for GPU containers:
287
300
* [ GPU Release Page] ( https://github.com/aws/deep-learning-containers/releases?q=tgi+AND+gpu&expanded=true )
301
+
302
+ To get the latest one, you can check the Hugging Face [ documentation] ( https://huggingface.co/docs/sagemaker/en/dlcs/available#llm-tgi ) .
303
+
304
+ HuggingFace Text Embeddings Inference (TEI) Containers
305
+ ===============================
306
+
307
+ To get the latest one, you can check the Hugging Face [ documentation] ( https://huggingface.co/docs/sagemaker/dlcs/available#text-embedding-inference ) .
308
+
309
+ HuggingFace Neuron Text Generation Inference (TGI) Containers
310
+ ===============================
311
+
312
+ Please refer to the following pages to view all available versions and tags for NeuronX containers:
288
313
* [ NeuronX Release Page] ( https://github.com/aws/deep-learning-containers/releases?q=tgi+AND+neuronx&expanded=true )
289
314
315
+ To get the latest one, you can check the Hugging Face [ documentation] ( https://huggingface.co/docs/optimum-neuron/en/containers#available-optimum-neuron-containers ) .
316
+
290
317
HuggingFace Neuron Inference Containers
291
318
===============================
292
319
320
+ Please refer to the following pages to view all available versions and tags for NeuronX containers:
321
+ * [ NeuronX Release Page] ( https://github.com/aws/deep-learning-containers/releases?q=huggingface-pytorch-inference-neuronx+AND+NOT+tgi&expanded=true )
322
+
323
+ To get the latest one, you can check the Hugging Face [ documentation] ( https://huggingface.co/docs/optimum-neuron/en/containers#available-optimum-neuron-containers ) .
324
+
293
325
| Framework | Neuron SDK Version | Job Type | Supported EC2 Instance Type | Python Version Options | Example URL |
294
326
| ------------------------------------------------------------------| -------------------| -----------| ----------------------------| -----------------------| ----------------------------------------------------------------------------------------------------------------------------------------------|
295
327
| PyTorch 1.10.2 with Neuron Inference and HuggingFace transformers | Neuron 1.19.1 | inference | inf1 | 3.7 (py37) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference-neuron:1.10.2-transformers4.20.1-neuron-py37-sdk1.19.1-ubuntu18.04 |
296
328
| PyTorch 1.13.1 with NeuronX Inference and HuggingFace transformers | Neuron 2.15.0 | inference | inf2/trn1 | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference-neuronx:1.13.1-transformers4.34.1-neuronx-py310-sdk2.15.0-ubuntu20.04 |
297
329
| PyTorch 2.1.2 with NeuronX Inference and HuggingFace transformers | Neuron 2.18.0 | inference | inf2/trn1 | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference-neuronx:2.1.2-transformers4.36.2-neuronx-py310-sdk2.18.0-ubuntu20.04 |
330
+ | PyTorch 2.1.2 with NeuronX Inference and HuggingFace transformers | Neuron 2.20.0 | inference | inf2/trn1 | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-inference-neuronx:2.1.2-transformers4.43.2-neuronx-py310-sdk2.20.0-ubuntu20.04 |
298
331
299
332
HuggingFace Neuron Training Containers
300
333
===============================
301
334
335
+ Please refer to the following pages to view all available versions and tags for NeuronX containers:
336
+ * [ NeuronX Release Page] ( https://github.com/aws/deep-learning-containers/releases?q=huggingface-pytorch-training-neuronx+AND+NOT+tgi&expanded=true )
337
+
338
+ To get the latest one, you can check the Hugging Face [ documentation] ( https://huggingface.co/docs/optimum-neuron/en/containers#available-optimum-neuron-containers ) .
339
+
302
340
| Framework | Neuron SDK Version | Job Type | Supported EC2 Instance Type | Python Version Options | Example URL |
303
341
| ------------------------------------------------------------------| -------------------| -----------| ----------------------------| -----------------------| ----------------------------------------------------------------------------------------------------------------------------------------------|
304
342
| PyTorch 1.13.1 with NeuronX Training and HuggingFace transformers | Neuron 2.18.0 | training | trn1 | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training-neuronx:1.13.1-transformers4.36.2-neuronx-py310-sdk2.18.0-ubuntu20.04 |
343
+ | PyTorch 2.1.2 with NeuronX Training and HuggingFace transformers | Neuron 2.20.0 | training | trn1 | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/huggingface-pytorch-training-neuronx:2.1.2-transformers4.48.1-neuronx-py310-sdk2.20.0-ubuntu20.04 |
305
344
306
345
StabilityAI Inference Containers
307
346
===============================
0 commit comments