[AMDGPU] Fix AGPR_32 reg assign for mfma scale ops #168964

hjagasiaAMD · 2025-11-20T22:29:06Z

In MFMA rewrite pass, prevent AGPR_32 reg class assignment for scale operands, not permitted by instruction format.

llvmbot · 2025-11-20T22:29:43Z

@llvm/pr-subscribers-backend-amdgpu

Author: None (hjagasiaAMD)

Changes

In MFMA rewrite pass, prevent AGPR_32 reg class assignment for scale operands, not permitted by instruction format.

Full diff: https://github.com/llvm/llvm-project/pull/168964.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp (+4)
(modified) llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir (+3-3)

diff --git a/llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp b/llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp
index 89c16dadb4b41..b5e3187289160 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp
@@ -302,6 +302,10 @@ bool AMDGPURewriteAGPRCopyMFMAImpl::attemptReassignmentsToAGPR(
     const TargetRegisterClass *EquivalentAGPRRegClass =
         TRI.getEquivalentAGPRClass(MRI.getRegClass(InterferingReg));
 
+    // Do not reassign scale operands
+    if (EquivalentAGPRRegClass == &AMDGPU::AGPR_32RegClass)
+      return false;
+
     MCPhysReg Assignable = AMDGPU::NoRegister;
     if (EquivalentAGPRRegClass->contains(PrefPhysReg) &&
         LRM.checkInterference(ReassignLI, PrefPhysReg) ==
diff --git a/llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir b/llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir
index ab56c9982753f..12be806960b67 100644
--- a/llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir
+++ b/llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir
@@ -1,6 +1,6 @@
-# RUN: not llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx950 -run-pass=greedy,amdgpu-rewrite-agpr-copy-mfma -verify-machineinstrs -o - %s 2>&1 | FileCheck %s
-# CHECK: Illegal virtual register for instruction
-# CHECK: Expected a VGPR_32 register, but got a AGPR_32 register
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx950 -run-pass=greedy,amdgpu-rewrite-agpr-copy-mfma -verify-machineinstrs -o - %s 2>&1 | FileCheck %s
+# CHECK-NOT: Illegal virtual register for instruction
+# CHECK-NOT: Expected a VGPR_32 register, but got a AGPR_32 register
  
 # Test for issue in amdgpu-rewrite-agpr-copy-mfma, which reassigns scale operand
 # in vgpr_32 register to agpr_32, not permitted by instruction format.

arsenm · 2025-11-20T22:58:49Z

llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir

-# CHECK: Illegal virtual register for instruction
-# CHECK: Expected a VGPR_32 register, but got a AGPR_32 register
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx950 -run-pass=greedy,amdgpu-rewrite-agpr-copy-mfma -verify-machineinstrs -o - %s 2>&1 | FileCheck %s
+# CHECK-NOT: Illegal virtual register for instruction


-NOT checks are close to useless, especially for checking error messages. Generate checks for the actual output

arsenm · 2025-11-20T23:00:17Z

llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp

        TRI.getEquivalentAGPRClass(MRI.getRegClass(InterferingReg));

+    // Do not reassign scale operands
+    if (EquivalentAGPRRegClass == &AMDGPU::AGPR_32RegClass)


This doesn't seem like the right condition. It just happens the scale operands are the only 32-bit input case. This should more directly check the operand constraint instead of checking a hardcodes class equality

github-actions · 2025-11-20T23:13:13Z

🐧 Linux x64 Test Results

186858 tests passed
4906 tests skipped

✅ The build succeeded and all tests passed.

ronlieb

LGTM, need @arsenm to do final approve

arsenm · 2025-12-01T23:54:14Z

llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp

      if (isRewriteCandidate(*MI)) {
+
+        int AGPROp = AMDGPU::getMFMASrcCVDstAGPROp(MI->getOpcode());
+        MachineInstrBuilder TmpMIB =


Definitely should not be creating temporary instructions

arsenm · 2025-12-01T23:55:05Z

llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp

+        unsigned OpNo = &MO - &MI->getOperand(0);
+        const TargetRegisterClass *EquivalentAGPRRegClass =
+            TRI.getEquivalentAGPRClass(MRI.getRegClass(Reg));
+        const TargetRegisterClass *Allowed = TmpMI->getRegClassConstraintEffect(


You want TargetInstrInfo::getRegClass to get the static constraint of the known operand (alternatively, you could check that the use is one of the known src0/src1 operands and not the _scale name)

llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp

Co-authored-by: Matt Arsenault <[email protected]>

llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir

Co-authored-by: Matt Arsenault <[email protected]>

github-actions · 2025-12-02T17:29:30Z

✅ With the latest revision this PR passed the C/C++ code formatter.

In MFMA rewrite pass, prevent AGPR_32 reg class assignment for scale operands, not permitted by instruction format. --------- Co-authored-by: Matt Arsenault <[email protected]>

[AMDGPU] Fix AGPR_32 reg assign for mfma scale ops

1c2072f

In MFMA rewrite pass, prevent AGPR_32 reg class assignment for scale operands, not permitted by instruction format.

llvmbot added the backend:AMDGPU label Nov 20, 2025

ronlieb requested review from arsenm and ronlieb November 20, 2025 22:30

arsenm reviewed Nov 20, 2025

View reviewed changes

hjagasiaAMD added 2 commits December 1, 2025 09:06

Check operand constraints and update mir checks.

d9405f2

Merge branch 'main' into agpr-copy-mfma

92cc4c8

hjagasiaAMD requested a review from arsenm December 1, 2025 17:36

ronlieb approved these changes Dec 1, 2025

View reviewed changes

arsenm reviewed Dec 1, 2025

View reviewed changes

Get the static constraint of the known operand.

08fc310

hjagasiaAMD requested a review from arsenm December 2, 2025 16:27

arsenm reviewed Dec 2, 2025

View reviewed changes

llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp Outdated Show resolved Hide resolved

hjagasiaAMD and others added 2 commits December 2, 2025 11:25

Update llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp

aff0f88

Co-authored-by: Matt Arsenault <[email protected]>

Update llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp

1fc74ea

Co-authored-by: Matt Arsenault <[email protected]>

arsenm reviewed Dec 2, 2025

View reviewed changes

llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir Outdated Show resolved Hide resolved

Update llvm/test/CodeGen/AMDGPU/rewrite-vgpr-mfma-scale-to-agpr.mir

ef61605

Co-authored-by: Matt Arsenault <[email protected]>

Format

a55eace

arsenm approved these changes Dec 2, 2025

View reviewed changes

ronlieb merged commit 2183846 into llvm:main Dec 2, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Fix AGPR_32 reg assign for mfma scale ops #168964

[AMDGPU] Fix AGPR_32 reg assign for mfma scale ops #168964

Uh oh!

hjagasiaAMD commented Nov 20, 2025

Uh oh!

llvmbot commented Nov 20, 2025

Uh oh!

arsenm Nov 20, 2025

Uh oh!

arsenm Nov 20, 2025

Uh oh!

github-actions bot commented Nov 20, 2025 •

edited

Loading

Uh oh!

ronlieb left a comment

Uh oh!

arsenm Dec 1, 2025

Uh oh!

arsenm Dec 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[AMDGPU] Fix AGPR_32 reg assign for mfma scale ops #168964

[AMDGPU] Fix AGPR_32 reg assign for mfma scale ops #168964

Uh oh!

Conversation

hjagasiaAMD commented Nov 20, 2025

Uh oh!

llvmbot commented Nov 20, 2025

Uh oh!

arsenm Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🐧 Linux x64 Test Results

Uh oh!

ronlieb left a comment

Choose a reason for hiding this comment

Uh oh!

arsenm Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Nov 20, 2025 •

edited

Loading

github-actions bot commented Dec 2, 2025 •

edited

Loading