Skip to content

Commit 88fc04c

Browse files
arsenmnzaghen
authored andcommitted
AMDGPU: Handle rewriting non-tied MFMA to AGPR form
If src2 and dst aren't the same register, to fold a copy to AGPR into the instruction we also need to reassign src2 to an available AGPR. All the other uses of src2 also need to be compatible with the AGPR replacement in order to avoid inserting other copies somewhere else. Perform this transform, after verifying all other uses are compatible with AGPR, and have an available AGPR available at all points (which effectively means rewriting a full chain of mfmas and load/store at once).
1 parent c3d7316 commit 88fc04c

File tree

2 files changed

+231
-83
lines changed

2 files changed

+231
-83
lines changed

0 commit comments

Comments
 (0)