Skip to content

Commit 771d023

Browse files
authored
Dockerfile: use base instead of cuda-runtime as base for server-release (#50)
The cuda-runtime target is ~2.3 GB, but duplicates libraries that are installed in the python virtualenv when installing `torch` and the various `nvidia-` modules. By using base, we shave off ~2.3 GB off the final image. Signed-off-by: Daniele Trifirò <[email protected]>
1 parent f7d3c5f commit 771d023

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -323,7 +323,7 @@ RUN cp server/transformers_patch/modeling_codegen.py ${SITE_PACKAGES}/transforme
323323

324324

325325
## Final Inference Server image ################################################
326-
FROM cuda-runtime as server-release
326+
FROM base as server-release
327327
ARG PYTHON_VERSION
328328
ARG SITE_PACKAGES=/opt/tgis/lib/python${PYTHON_VERSION}/site-packages
329329

0 commit comments

Comments
 (0)