Skip to content

Commit ff5f396

Browse files
authored
AMDGPU: Handle rewriting non-tied MFMA to AGPR form (#153015)
If src2 and dst aren't the same register, to fold a copy to AGPR into the instruction we also need to reassign src2 to an available AGPR. All the other uses of src2 also need to be compatible with the AGPR replacement in order to avoid inserting other copies somewhere else. Perform this transform, after verifying all other uses are compatible with AGPR, and have an available AGPR available at all points (which effectively means rewriting a full chain of mfmas and load/store at once).
1 parent 3856bb6 commit ff5f396

File tree

3 files changed

+373
-168
lines changed

3 files changed

+373
-168
lines changed

0 commit comments

Comments
 (0)