Skip to content
This repository was archived by the owner on Sep 4, 2025. It is now read-only.

Commit 3be261c

Browse files
committed
Dockerfile.ubi: remove leftover flash-attn references
1 parent bd23984 commit 3be261c

File tree

1 file changed

+0
-4
lines changed

1 file changed

+0
-4
lines changed

Dockerfile.ubi

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -240,10 +240,6 @@ RUN pip install \
240240
mv /root/.config/vllm/nccl/cu12/libnccl.so.2.18.1 /opt/vllm/lib/ && \
241241
chmod 0755 /opt/vllm/lib/libnccl.so.2.18.1
242242

243-
# Install flash attention (from pre-built wheel)
244-
RUN --mount=type=bind,from=flash-attn-builder,src=/usr/src/flash-attention-v2,target=/usr/src/flash-attention-v2 \
245-
pip install /usr/src/flash-attention-v2/*.whl --no-cache-dir
246-
247243
RUN --mount=type=cache,target=/root/.cache/pip \
248244
pip install \
249245
# additional dependencies for the TGIS gRPC server

0 commit comments

Comments
 (0)