[flang][rt] Add noinline attributes for CUDA compile path for successful compilation #161760

modiking · 2025-10-03T01:35:58Z

NVCC does more aggressive inlining than Clang/GCC causing the exported functions in extrema.cpp and findloc.cpp to become extremely large from function specializations leading to compilation timeouts. Marking the 2 functions in this change as noinline for NVCC alleviates this problem as it removes the worst of the cross-matrix argument specializations.

Also remove the workaround in #156542 that opted out findloc.cpp from the CUDA flang-rt build

Testing:
ninja flang-rt builds in ~30 minutes, these 2 files build in ~3 minutes

clementval · 2025-10-03T01:38:05Z

Thanks for the workaround @modiking. I added @klausler who is the main runtime developer as reviewer. Give him some time to chime in if he has anything to say.

github-actions · 2025-10-03T01:39:59Z

✅ With the latest revision this PR passed the C/C++ code formatter.

vzakhari · 2025-10-03T01:44:32Z

Thanks for the changes! Please use RT_DEVICE_NOINLINE macro instead. We have it defined in flang/include/flang/Common/api-attrs.h.

clementval

LGTM. Thanks

klausler

Please get Slava's approval before merging.

vzakhari

Thank you!

modiking · 2025-10-03T16:48:47Z

Appreciate the quick reviews!

modiking added 2 commits October 1, 2025 20:04

enable full flang cuda build

c953243

add comments

65c8c54

modiking requested a review from clementval October 3, 2025 01:35

clementval requested a review from klausler October 3, 2025 01:37

clementval requested a review from vzakhari October 3, 2025 01:41

use RT_DEVICE_NOINLINE and clang format

7c71531

clementval approved these changes Oct 3, 2025

View reviewed changes

klausler approved these changes Oct 3, 2025

View reviewed changes

vzakhari approved these changes Oct 3, 2025

View reviewed changes

modiking merged commit 74180eb into llvm:main Oct 3, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[flang][rt] Add noinline attributes for CUDA compile path for successful compilation #161760

[flang][rt] Add noinline attributes for CUDA compile path for successful compilation #161760

Uh oh!

modiking commented Oct 3, 2025

Uh oh!

clementval commented Oct 3, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 3, 2025 •

edited

Loading

Uh oh!

vzakhari commented Oct 3, 2025

Uh oh!

clementval left a comment

Uh oh!

klausler left a comment

Uh oh!

vzakhari left a comment

Uh oh!

modiking commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[flang][rt] Add noinline attributes for CUDA compile path for successful compilation #161760

[flang][rt] Add noinline attributes for CUDA compile path for successful compilation #161760

Uh oh!

Conversation

modiking commented Oct 3, 2025

Uh oh!

clementval commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vzakhari commented Oct 3, 2025

Uh oh!

clementval left a comment

Choose a reason for hiding this comment

Uh oh!

klausler left a comment

Choose a reason for hiding this comment

Uh oh!

vzakhari left a comment

Choose a reason for hiding this comment

Uh oh!

modiking commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

clementval commented Oct 3, 2025 •

edited

Loading

github-actions bot commented Oct 3, 2025 •

edited

Loading