Skip to content

Commit e3fa0b1

Browse files
committed
fix: remove cutlass-dsl stub that crashes TRT-LLM on CUDA 13.1 (#7372)
1 parent b1818dc commit e3fa0b1

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

container/templates/trtllm_runtime.Dockerfile

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -252,7 +252,11 @@ RUN --mount=type=bind,source=./container/deps/requirements.txt,target=/tmp/requi
252252
--extra-index-url https://download.pytorch.org/whl/cu130 \
253253
--requirement /tmp/requirements.txt \
254254
--requirement /tmp/requirements.test.txt \
255-
cupy-cuda13x
255+
cupy-cuda13x && \
256+
# nvidia-cutlass-dsl-libs-base==4.4.1 (transitive dep) ships a stub cute/experimental/__init__.py
257+
# that unconditionally raises NotImplementedError, crashing TRT-LLM on import. cutlass-dsl==4.3.4
258+
# (pinned by TRT-LLM) works without cute/experimental/. Remove the stub to fix the NotImplementedError.
259+
rm -rf ${VIRTUAL_ENV}/lib/python${PYTHON_VERSION}/site-packages/nvidia_cutlass_dsl/python_packages/cutlass/cute/experimental/
256260

257261
# Copy tests, deploy and components for CI with correct ownership
258262
# Pattern: COPY --chmod=775 <path>; chmod g+w <path> done later as root because COPY --chmod only affects <path>/*, not <path>

0 commit comments

Comments
 (0)