Skip to content

Commit 0dfb7c1

Browse files
kevalmorabia97yeyu-nvidia
authored andcommitted
Update dockerfile to TensorRT-LLM 0.17
Signed-off-by: Keval Morabia <[email protected]>
1 parent f21ca50 commit 0dfb7c1

File tree

2 files changed

+5
-4
lines changed

2 files changed

+5
-4
lines changed

CHANGELOG.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,7 @@ Model Optimizer Changelog (Linux)
66

77
**Backward Breaking Changes**
88

9+
- Support TensorRT-LLM to 0.17. Examples (e.g. benchmark task in llm_ptq) may not be fully compatible with TensorRT-LLM 0.15.
910
- Nvidia TensorRT Model Optimizer has changed its LICENSE from NVIDIA Proprietary (library wheel) and MIT (examples) to Apache 2.0 in this first full OSS release.
1011
- Deprecate Python 3.8, Torch 2.0, and Cuda 11.x support.
1112
- ONNX Runtime dependency upgraded to 1.20 which no longer supports Python 3.9.

docker/Dockerfile

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
FROM nvidia/cuda:12.6.2-devel-ubuntu22.04
1+
FROM nvidia/cuda:12.8.0-devel-ubuntu22.04
22

33
WORKDIR /workspace
44

@@ -13,7 +13,7 @@ RUN rm -rf /usr/lib/python3/dist-packages/setuptools*
1313
RUN pip install setuptools -U
1414

1515
# Install TensorRT-LLM
16-
ARG TRT_LLM_VERSION=0.15.0
16+
ARG TRT_LLM_VERSION=0.17.0
1717
RUN pip install "tensorrt-llm~=$TRT_LLM_VERSION" -U
1818
RUN git clone --depth 1 --branch "v$TRT_LLM_VERSION" https://github.com/NVIDIA/TensorRT-LLM.git && \
1919
mkdir tensorrt-llm && \
@@ -26,7 +26,7 @@ ENV LD_LIBRARY_PATH=/usr/local/lib/python3.10/dist-packages/tensorrt_llm/libs:$L
2626
ENV LD_LIBRARY_PATH=/usr/local/lib/python3.10/dist-packages/nvidia/cudnn/lib:$LD_LIBRARY_PATH
2727

2828
# Install TensorRT dev environment
29-
ARG TENSORRT_URL=https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.6.0/tars/TensorRT-10.6.0.26.Linux.x86_64-gnu.cuda-12.6.tar.gz
29+
ARG TENSORRT_URL=https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.8.0/tars/TensorRT-10.8.0.43.Linux.x86_64-gnu.cuda-12.8.tar.gz
3030
RUN wget -q -O tensorrt.tar.gz $TENSORRT_URL && \
3131
tar -xf tensorrt.tar.gz && \
3232
cp TensorRT-*/bin/trtexec /usr/local/bin && \
@@ -39,7 +39,7 @@ ENV LD_LIBRARY_PATH=$TRT_LIB_PATH:$LD_LIBRARY_PATH
3939

4040
# Install modelopt with all optional dependencies and pre-compile CUDA extensions otherwise they take several minutes on every docker run
4141
RUN pip install "nvidia-modelopt[all]" -U
42-
ENV TORCH_CUDA_ARCH_LIST="5.2 6.0 6.1 7.0 7.2 7.5 8.0 8.6 8.7 9.0+PTX"
42+
ENV TORCH_CUDA_ARCH_LIST="8.0 8.6 8.7 8.9 9.0+PTX"
4343
RUN python -c "import modelopt.torch.quantization.extensions as ext; ext.precompile()"
4444

4545
# Find and install requirements.txt files for all examples excluding windows

0 commit comments

Comments
 (0)