Skip to content

Commit 21e2b8b

Browse files
authored
[trtllm] Update available_images.md with TRT-LLM images and set force_release for TRT LLM container to False (#4968)
* [trtllm] Set force_release for 0.33.0 djl-serving container to False * [trtllm] Update available_images.md with djl-serving 0.33.0 images
1 parent dea45f8 commit 21e2b8b

File tree

2 files changed

+3
-1
lines changed

2 files changed

+3
-1
lines changed

available_images.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -212,6 +212,8 @@ Starting LMI V10 (0.28.0), we are changing the name from LMI DeepSpeed DLC to LM
212212

213213
| Framework | Job Type | Accelerator | Python Version Options | Example URL |
214214
|-----------------------------------------------------------------------------------------------------------------------------|-----------|-------------|------------------------|-------------------------------------------------------------------------------------------|
215+
| DJLServing 0.33.0 with LMI Dist 15.0.0, vLLM 0.8.4, HuggingFace Transformers 4.51.3, and HuggingFace Accelerate 1.0.1 | inference | GPU | 3.12 (py312) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.33.0-lmi15.0.0-cu128 |
216+
| DJLServing 0.33.0 with TensorRT-LLM 0.21.0rc1, HuggingFace Transformers 4.51.3, and HuggingFace Accelerate 1.0.1 | inference | GPU | 3.12 (py312) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.33.0-tensorrtllm0.21.0-cu128 |
215217
| DJLServing 0.32.0 with LMI Dist 13.0.0, vLLM 0.7.1, HuggingFace Transformers 4.45.2, and HuggingFace Accelerate 1.0.1 | inference | GPU | 3.11 (py311) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.32.0-lmi14.0.0-cu126 |
216218
| DJLServing 0.32.0 with TensorRT-LLM 0.12.0, HuggingFace Transformers 4.44.2, and HuggingFace Accelerate 0.32.1 | inference | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.32.0-tensorrtllm0.12.0-cu125 |
217219
| DJLServing 0.31.0 with LMI Dist 13.0.0, vLLM 0.6.3.post1, HuggingFace Transformers 4.45.2, and HuggingFace Accelerate 1.0.1 | inference | GPU | 3.11 (py311) | 763104351884.dkr.ecr.us-west-2.amazonaws.com/djl-inference:0.31.0-lmi13.0.0-cu124 |

release_images_inference.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -195,7 +195,7 @@ release_images:
195195
cuda_version: "cu128"
196196
example: False
197197
disable_sm_tag: True
198-
force_release: True
198+
force_release: False
199199
16:
200200
framework: "djl"
201201
version: "0.32.0"

0 commit comments

Comments
 (0)