-
Notifications
You must be signed in to change notification settings - Fork 4.7k
Fix Evoformer compilation #7760
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix Evoformer compilation #7760
Conversation
50e371a to
643cac1
Compare
0fd060b to
1e71bdb
Compare
4ea96ed to
8c9ef4c
Compare
9c049d9 to
a9701c2
Compare
Signed-off-by: Santi Villalba <[email protected]>
Signed-off-by: Santi Villalba <[email protected]>
…ystems Signed-off-by: Santi Villalba <[email protected]>
Signed-off-by: Santi Villalba <[email protected]>
Signed-off-by: Santi Villalba <[email protected]>
Signed-off-by: Santi Villalba <[email protected]>
a9701c2 to
f0c7b42
Compare
|
Hi @sdvillal By the way,
|
Signed-off-by: Santi Villalba <[email protected]>
|
Thanks a lot for the quick review and merge @tohtana! I have fixed formatting (sorry about it, one should read the contributing guidelines before contributing...). I have not personally experienced the mismatch. We have been running on:
I could try to run the test a few times in this context and see if it happens for me, could that info be useful? In any case, I feel the extension is showing its age and it might require some love to these GEMMs and generally to make it worthwhile to use on Hopper and newer. |
`EvoformerAttnBuilder` has some problems which preclude compiling the extension on several scenarios (e.g., [isolated conda environment with cuda toolchain](aqlaboratory/openfold-3#34), lack of hardware in the system) and breaks some standard DeepSpeed configuration of target capabilities. *Changes* - Fix evoformer CUTLASS detection: - Allow to skip it, useful when CUTLASS is already correctly setup (e.g., in a conda environment with CUTLASS and the CUDA toolchain) - Fix misleading use of deprecated nvidia-cutlass pypi package by actually using the provided bindings but discouraging this route as [these bindings are not maintained anymore](NVIDIA/cutlass#2119) - Fix evoformer compilation with no GPU is present: - this is taken care correctly and more generally by builder.compute_capability_args - allow for cross-compilation in systems without GPU - allows for compilation against all available virtual architectures and binary outputs - see e.g., #5308 - Make all these changes configurable and explicit through documented environment variables Tested in all scenarios. --------- Signed-off-by: Santi Villalba <[email protected]> Co-authored-by: Masahiro Tanaka <[email protected]>
|
@sdvillal Yes, I encountered the issue with H100. The test doesn't throw an error with L40S on our CI. |
EvoformerAttnBuilderhas some problems which preclude compiling the extension on several scenarios (e.g., isolated conda environment with cuda toolchain, lack of hardware in the system) and breaks some standard DeepSpeed configuration of target capabilities.Changes
Fix evoformer CUTLASS detection:
Fix evoformer compilation with no GPU is present:
Make all these changes configurable and explicit through documented environment variables
Tested in all scenarios.