Skip to content

Commit c963595

Browse files
anhuongSsukritialuu317
authored
build(deps): changes needed to support mamba/jamba model (#400)
* enable mamba deps Signed-off-by: Sukriti-Sharma4 <[email protected]> * fix deps Signed-off-by: Sukriti-Sharma4 <[email protected]> * try dockerfile Signed-off-by: Sukriti-Sharma4 <[email protected]> * trial 2 dockerfile Signed-off-by: Sukriti-Sharma4 <[email protected]> * dockerfile trial 3 Signed-off-by: Sukriti-Sharma4 <[email protected]> * install cudnn9-cuda-12 and mamba separately Signed-off-by: Anh Uong <[email protected]> * deps: update transformers fork to 4.46 Signed-off-by: Anh Uong <[email protected]> * update to latest transformers as PR merged Signed-off-by: Sukriti-Sharma4 <[email protected]> * clean up Dockerfile Signed-off-by: Sukriti-Sharma4 <[email protected]> * test commit Signed-off-by: Sukriti-Sharma4 <[email protected]> * install cudnn Signed-off-by: Sukriti-Sharma4 <[email protected]> * test Signed-off-by: Sukriti-Sharma4 <[email protected]> * remove extra dep Signed-off-by: Sukriti-Sharma4 <[email protected]> * version cleanup Signed-off-by: Sukriti-Sharma4 <[email protected]> * test Signed-off-by: Sukriti-Sharma4 <[email protected]> * Try Joe's freeup-disk-space runner Signed-off-by: Angel Luu <[email protected]> * test try for images to pass Signed-off-by: Sukriti-Sharma4 <[email protected]> * Remove versioning for libcusparselt Signed-off-by: Angel Luu <[email protected]> --------- Signed-off-by: Sukriti-Sharma4 <[email protected]> Signed-off-by: Anh Uong <[email protected]> Signed-off-by: Angel Luu <[email protected]> Co-authored-by: Sukriti-Sharma4 <[email protected]> Co-authored-by: Angel Luu <[email protected]> Co-authored-by: Sukriti Sharma <[email protected]>
1 parent 66acb4f commit c963595

File tree

2 files changed

+14
-2
lines changed

2 files changed

+14
-2
lines changed

build/Dockerfile

Lines changed: 13 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -88,7 +88,8 @@ ENV NV_CUDA_CUDART_DEV_VERSION=12.1.55-1 \
8888
NV_NVML_DEV_VERSION=12.1.55-1 \
8989
NV_LIBCUBLAS_DEV_VERSION=12.1.0.26-1 \
9090
NV_LIBNPP_DEV_VERSION=12.0.2.50-1 \
91-
NV_LIBNCCL_DEV_PACKAGE_VERSION=2.18.3-1+cuda12.1
91+
NV_LIBNCCL_DEV_PACKAGE_VERSION=2.18.3-1+cuda12.1 \
92+
NV_CUDNN9_CUDA_VERSION=9.6.0.74-1
9293

9394
RUN dnf config-manager \
9495
--add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel9/x86_64/cuda-rhel9.repo \
@@ -103,6 +104,15 @@ RUN dnf config-manager \
103104
libnccl-devel-${NV_LIBNCCL_DEV_PACKAGE_VERSION} \
104105
&& dnf clean all
105106

107+
# opening connection for too long in one go was resulting in timeouts
108+
RUN dnf config-manager \
109+
--add-repo https://developer.download.nvidia.com/compute/cuda/repos/rhel9/x86_64/cuda-rhel9.repo \
110+
&& dnf clean packages \
111+
&& dnf install -y \
112+
libcusparselt0 libcusparselt-devel \
113+
cudnn9-cuda-12-6-${NV_CUDNN9_CUDA_VERSION} \
114+
&& dnf clean all
115+
106116
ENV LIBRARY_PATH="$CUDA_HOME/lib64/stubs"
107117

108118
FROM cuda-devel AS python-installations
@@ -138,7 +148,8 @@ RUN if [[ -z "${WHEEL_VERSION}" ]]; \
138148
RUN --mount=type=cache,target=/home/${USER}/.cache/pip,uid=${USER_UID} \
139149
python -m pip install --user wheel && \
140150
python -m pip install --user "$(head bdist_name)" && \
141-
python -m pip install --user "$(head bdist_name)[flash-attn]"
151+
python -m pip install --user "$(head bdist_name)[flash-attn]" && \
152+
python -m pip install --user "$(head bdist_name)[mamba]"
142153

143154
# fms_acceleration_peft = PEFT-training, e.g., 4bit QLoRA
144155
# fms_acceleration_foak = Fused LoRA and triton kernels

pyproject.toml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,7 @@ aim = ["aim>=3.19.0,<4.0"]
4848
mlflow = ["mlflow"]
4949
fms-accel = ["fms-acceleration>=0.6"]
5050
gptq-dev = ["auto_gptq>0.4.2", "optimum>=1.15.0"]
51+
mamba = ["mamba_ssm[causal-conv1d] @ git+https://github.com/state-spaces/mamba.git"]
5152
scanner-dev = ["HFResourceScanner>=0.1.0"]
5253

5354

0 commit comments

Comments
 (0)