Skip to content

[Bug Fix]Fix undefined symbol : cuDriverGetVersion error after updating CUTLASS#2116

Closed
HydraQYH wants to merge 1 commit intoDao-AILab:mainfrom
HydraQYH:fix_cuDriverGetVersion_undefined
Closed

[Bug Fix]Fix undefined symbol : cuDriverGetVersion error after updating CUTLASS#2116
HydraQYH wants to merge 1 commit intoDao-AILab:mainfrom
HydraQYH:fix_cuDriverGetVersion_undefined

Conversation

@HydraQYH
Copy link
Contributor

@HydraQYH HydraQYH commented Dec 31, 2025

I saw that #2108 updated CUTLASS.
After the CUTLASS update, FA3 encountered the following issues:
FA3 Bug
The reason is that new CUTLASS uses the cuDriverGetVersion API:
https://github.com/NVIDIA/cutlass/blame/853ad93d60b23b4f87bc46dfbc3c9ce757773ed7/include/cute/atom/copy_traits_sm90_tma.hpp#L1056
This symbol is located in libcuda.so, but this library is not linked. So I added a extra_link_args to fix this bug.
@tridao

@HydraQYH
Copy link
Contributor Author

HydraQYH commented Jan 4, 2026

@tridao @drisspg @johnnynunez Hello, can you take a look?

@johnnynunez
Copy link
Contributor

@tridao @drisspg @johnnynunez Hello, can you take a look?

@drisspg @tridao maybe is related with this issue also apart from tvm ffi

sgl-project/sglang#15182
@Qiaolin-Yu @Fridge003

@johnnynunez
Copy link
Contributor

I saw that #2108 updated CUTLASS. After the CUTLASS update, FA3 encountered the following issues: FA3 Bug The reason is that new CUTLASS uses the cuDriverGetVersion API: https://github.com/NVIDIA/cutlass/blame/853ad93d60b23b4f87bc46dfbc3c9ce757773ed7/include/cute/atom/copy_traits_sm90_tma.hpp#L1056 This symbol is located in libcuda.so, but this library is not linked. So I added a extra_link_args to fix this bug. @tridao

Is this occuring in fa cute dsl also? @HydraQYH flash-attention/cute/

@drisspg
Copy link
Collaborator

drisspg commented Jan 4, 2026

I would need to look deeper at this issue first, but we should not link against driver Apis. Cutlass has a mechanism for dynamic loading at runtime the unresolved symbols and if something new was added that broke this that is a bug in cutlass.

@johnnynunez
Copy link
Contributor

thanks @drisspg reported to cutlass team

@johnnynunez
Copy link
Contributor

I saw that #2108 updated CUTLASS. After the CUTLASS update, FA3 encountered the following issues: FA3 Bug The reason is that new CUTLASS uses the cuDriverGetVersion API: https://github.com/NVIDIA/cutlass/blame/853ad93d60b23b4f87bc46dfbc3c9ce757773ed7/include/cute/atom/copy_traits_sm90_tma.hpp#L1056 This symbol is located in libcuda.so, but this library is not linked. So I added a extra_link_args to fix this bug. @tridao

A fix is coming in v4.4.0.dev @drisspg @tridao @HydraQYH
You can close it

@depaulmillz
Copy link

This PR to CUTLASS should fix the issue. It switches to the runtime API instead of the driver API - NVIDIA/cutlass#2928

@HydraQYH
Copy link
Contributor Author

HydraQYH commented Jan 5, 2026

I saw that #2108 updated CUTLASS. After the CUTLASS update, FA3 encountered the following issues: FA3 Bug The reason is that new CUTLASS uses the cuDriverGetVersion API: https://github.com/NVIDIA/cutlass/blame/853ad93d60b23b4f87bc46dfbc3c9ce757773ed7/include/cute/atom/copy_traits_sm90_tma.hpp#L1056 This symbol is located in libcuda.so, but this library is not linked. So I added a extra_link_args to fix this bug. @tridao

A fix is coming in v4.4.0.dev @drisspg @tridao @HydraQYH You can close it

@johnnynunez Ok. I will close this PR after @depaulmillz 's PR is merged and FA updates CUTLASS.

@HydraQYH
Copy link
Contributor Author

HydraQYH commented Jan 6, 2026

Move to #2142

@HydraQYH HydraQYH closed this Jan 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants