[AMDGPU] Added hot-block-rematerialize pass #126331

adam-yang · 2025-02-08T01:27:39Z

Note: Before this is ready for review, I intend to remove AMDGPUMirSyncDependency and AMDGPUMirDivergenceAnalysis. They were basically forks of the IR versions of the passes from before the upstream MIR versions were available.

I have the code path that uses MachineUniformityInfo and the divergence related unit test passes with it, but I'd like to do more testing before deleting the other ones.

github-actions · 2025-02-08T01:30:59Z

✅ With the latest revision this PR passed the undef deprecator.

github-actions · 2025-02-08T01:30:59Z

✅ With the latest revision this PR passed the C/C++ code formatter.

dstutt · 2025-02-26T17:31:27Z

Added Jay and Carl fyi (still needs cleanup)

perlfu · 2025-02-27T05:23:34Z

Since this is only a draft and MirDivergence parts are going away I only I had a quick look.
I wonder if parts like AMDGPULatencyTracker should be full analysis passes, rather than just utilities?

jayfoad · 2025-02-27T11:52:30Z

llvm/lib/Target/AMDGPU/AMDGPUOccupancyAndLatencyHelper.cpp

Need to #include <cmath> for this. But perhaps it would be better to use an integer-only power operation (not sure if we already have one anywhere in LLVM).

jayfoad · 2025-02-27T12:00:50Z

Is this intended to be run just before pre-RA MachineScheduler?

Update: I see the pass currently requires SSA form, so it would have to be run earlier than that. Where would you put it in the pass pipeline?

dstutt · 2025-02-27T16:10:07Z

We've got some work in the pipeline to allow pre-RA machinescheduler to run in SSA form as well, so that might be ok.
Also, this might work well with some other work we're considering to add spilling as a separate pass before RA (but in SSA form).

…t I don't see any reason why we can't just require SSA

jayfoad · 2025-03-25T10:08:44Z

Hi @adam-yang, I've done some testing with this patch and noticed that options -amdgpu-remat-enable-hot-block-remat-aggressive and -amdgpu-remat-enable-sub-exp-remat seem to cause many crashes in the compiler. Are you interested in test cases for these? Or are these options not expected to be working yet?

adam-yang · 2025-03-25T18:37:46Z

Hi @adam-yang, I've done some testing with this patch and noticed that options -amdgpu-remat-enable-hot-block-remat-aggressive and -amdgpu-remat-enable-sub-exp-remat seem to cause many crashes in the compiler. Are you interested in test cases for these? Or are these options not expected to be working yet?

Yes could you send them to me please.

jayfoad · 2025-03-26T11:55:47Z

OK, here's a somewhat reduced test case for a crash with -amdgpu-remat-enable-sub-exp-remat. The verifier reports "Bad machine code: Found PHI instruction after non-PHI".
r1.txt

jayfoad · 2025-03-26T13:31:20Z

Here's one for -amdgpu-remat-enable-hot-block-remat-aggressive which fails an assertion "Invalid RC for virtual register": r2.txt

adam-yang · 2025-03-27T19:27:04Z

Here's one for -amdgpu-remat-enable-hot-block-remat-aggressive which fails an assertion "Invalid RC for virtual register": r2.txt

This revealed some incorrect assumptions in this PR. Lanes are assumed to be 32-bit, but new 16-bit subregs have been added so lanes are now 16-bit. I'll have to go through the change to fix some more issues related to this.

dstutt requested review from jayfoad and perlfu February 26, 2025 17:30

dstutt self-requested a review February 26, 2025 17:31

jayfoad reviewed Feb 27, 2025

View reviewed changes

adam-yang and others added 19 commits March 17, 2025 20:24

Added rematerialize pass and test.

b6eb3b3

Fixed build, and added simple tests that exercise major code paths

7739842

Test renames, only keeping the required flags for the tests

3539ab3

Using the mir uniformity analysis instead, which DOES require SSA; bu…

a33f944

…t I don't see any reason why we can't just require SSA

In block remat AND making v to s slightly more robust

2215b79

clang-format

d36a4ae

Added option to enable it in the target profile

bf396df

Fix PHI node handling in regpressure tracker

c64c4e4

Fixed the PHI issue

3dc22d4

Removed old forks of things

29eca4a

Clang format and warnings.

6b011fb

First batch of formatting changes

eb4f8c1

Batch 2

78ab7f3

More cleanup

d8b6711

More cleanups

f8eb7fb

Possibly the last batch of cleanup

0600e2f

Additional cleanup + format

84d8dd8

Added cmath

303a401

Wrong place for std header

971e556

adam-yang force-pushed the amdgpu_hot_block_remat branch from ede9e76 to 971e556 Compare March 18, 2025 03:24

adam-yang added 3 commits March 17, 2025 20:35

Made getMinimalSpanningSubRegIdxSetForLaneMask local

be03462

Fixed build break after rebase

436058b

Clang format

9dbab90

adam-yang marked this pull request as ready for review March 18, 2025 05:21

adam-yang added 2 commits March 18, 2025 11:43

Fixing undef deprecator failures

ebcbb24

Ran latest format

b5d143c

Fixed failing tests, and added tests

87d9404

adam-yang mentioned this pull request Apr 21, 2025

[AMDGPU] Added hot-block-rematerialize pass #136631

Open

[AMDGPU] Added hot-block-rematerialize pass #126331

Are you sure you want to change the base?

[AMDGPU] Added hot-block-rematerialize pass #126331

Uh oh!

Conversation

adam-yang commented Feb 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dstutt commented Feb 26, 2025

Uh oh!

perlfu commented Feb 27, 2025

Uh oh!

jayfoad Feb 27, 2025

Choose a reason for hiding this comment

Uh oh!

jayfoad commented Feb 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dstutt commented Feb 27, 2025

Uh oh!

jayfoad commented Mar 25, 2025

Uh oh!

adam-yang commented Mar 25, 2025

Uh oh!

jayfoad commented Mar 26, 2025

Uh oh!

jayfoad commented Mar 26, 2025

Uh oh!

adam-yang commented Mar 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

adam-yang commented Feb 8, 2025 •

edited

Loading

github-actions bot commented Feb 8, 2025 •

edited

Loading

github-actions bot commented Feb 8, 2025 •

edited

Loading

jayfoad commented Feb 27, 2025 •

edited

Loading