-
Notifications
You must be signed in to change notification settings - Fork 65
build(deps): changes needed to support mamba/jamba model #400
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Anh Uong <[email protected]>
Signed-off-by: Anh Uong <[email protected]>
|
Thanks for making a pull request! 😃 |
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
|
@anhuong just wanted to point out that bamba is now released so no need to use my forks anymore |
|
ya thanks @fabianlim . I have updated transformers. Just need to clean up the CUDNN versions tomorrow and then should be ready |
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Angel Luu <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
Signed-off-by: Sukriti-Sharma4 <[email protected]>
|
installing Strange part is sometimes randomly , actually only once the Travis build passed and I was able to build an image. The download links are correct for these CUDA libraries. These failures are only happening on this branch and not on main @aluu317 was able to build image locally and verified it has all deps installed. It also builds on Github . No idea whats happening with Travis builds |
Signed-off-by: Angel Luu <[email protected]>
Ssukriti
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @aluu317 for trying different things on Travis. We now have passing builds, though we arent sure why fixing version of libcudparselt was causing it to fail earlier. We will keep an eye and if failures resume, we will post on the Travis channel. We can merge for now probably
Description of the change
In order to support Mamba2ForCausalLM and JambaForCausalLM models, we needed to install deps mamba_ssm and transformers with changes from Fabian's fork. In order to install mamba_ssm, we needed package
cudnn9-cuda-12otherwise we hit errorImportError: libcudnn.so.9: cannot open shared object file: No such file or directory. Then, causal-conv1d failed to install from mamba_ssm due toModuleNotFoundError: No module named 'torch'which is why we are installing mamba_ssm as a separate dependency after the base deps are installed.Related issue number
How to verify the PR
Built image with these changes and was able to run tuning on Mamba and Jamba models.
Was the PR tested