Skip to content

Commit b82fc95

Browse files
committed
AMDGPU: Handle rewriting non-tied MFMA to AGPR form
If src2 and dst aren't the same register, to fold a copy to AGPR into the instruction we also need to reassign src2 to an available AGPR. All the other uses of src2 also need to be compatible with the AGPR replacement in order to avoid inserting other copies somewhere else. Perform this transform, after verifying all other uses are compatible with AGPR, and have an available AGPR available at all points (which effectively means rewriting a full chain of mfmas and load/store at once).
1 parent 4c13a1e commit b82fc95

File tree

2 files changed

+231
-83
lines changed

2 files changed

+231
-83
lines changed

0 commit comments

Comments
 (0)