[SROA] Add Stored Value Size Check for Tree-Structured Merge #162921

Chengjunp · 2025-10-10T21:11:56Z

The change fixes a bug in the SROA where tree-structured merge optimization was incorrectly applied when the size of the stored value was not a multiple of the new allocated element type size. The original change is #152793. A simple repro would be

define <1 x i32> @foo(<1 x i16> %a, <1 x i16> %b) {
entry:
  %alloca = alloca [1 x i32]

  %ptr0 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 0
  store <1 x i16> %a, ptr %ptr0

  %ptr1 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 1
  store <1 x i16> %b, ptr %ptr1

  %result = load <1 x i32>, ptr %alloca
  ret <1 x i32> %result
}

Currently, this will lead to a compile time crash.

In this change, we will skip the tree-structured merge for this case and fall back to normal SROA.

llvmbot · 2025-10-10T21:12:33Z

@llvm/pr-subscribers-llvm-transforms

Author: Chengjun (Chengjunp)

Changes

The change fixes a bug in the SROA where tree-structured merge optimization was incorrectly applied when the size of the stored value was not a multiple of the new allocated element type size. A simple repro would be

define &lt;1 x i32&gt; @<!-- -->foo(&lt;1 x i16&gt; %a, &lt;1 x i16&gt; %b) {
entry:
  %alloca = alloca [1 x i32]

  %ptr0 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 0
  store &lt;1 x i16&gt; %a, ptr %ptr0

  %ptr1 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 1
  store &lt;1 x i16&gt; %b, ptr %ptr1

  %result = load &lt;1 x i32&gt;, ptr %alloca
  ret &lt;1 x i32&gt; %result
}

Currently, this will lead to a compile time crash.

In this change, we will skip the tree-structured merge for this case and fall back to normal SROA.

Full diff: https://github.com/llvm/llvm-project/pull/162921.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Scalar/SROA.cpp (+8)
(modified) llvm/test/Transforms/SROA/vector-promotion-cannot-tree-structure-merge.ll (+14)

diff --git a/llvm/lib/Transforms/Scalar/SROA.cpp b/llvm/lib/Transforms/Scalar/SROA.cpp
index 45d3d493a9e68..dc24adab42a3b 100644
--- a/llvm/lib/Transforms/Scalar/SROA.cpp
+++ b/llvm/lib/Transforms/Scalar/SROA.cpp
@@ -2961,6 +2961,7 @@ class AllocaSliceRewriter : public InstVisitor<AllocaSliceRewriter, bool> {
         isa<FixedVectorType>(NewAI.getAllocatedType())
             ? cast<FixedVectorType>(NewAI.getAllocatedType())->getElementType()
             : Type::getInt8Ty(NewAI.getContext());
+    unsigned AllocatedEltTySize = DL.getTypeSizeInBits(AllocatedEltTy);
 
     // Helper to check if a type is
     //  1. A fixed vector type
@@ -2991,9 +2992,16 @@ class AllocaSliceRewriter : public InstVisitor<AllocaSliceRewriter, bool> {
         // Do not handle the case if
         //   1. The store does not meet the conditions in the helper function
         //   2. The store is volatile
+        //   3. The store value type size is less than the allocated element
+        //   type size
         if (!IsTypeValidForTreeStructuredMerge(
                 SI->getValueOperand()->getType()) ||
             SI->isVolatile())
+        return std::nullopt;
+        auto *VecTy = cast<FixedVectorType>(SI->getValueOperand()->getType());
+        unsigned NumElts = VecTy->getNumElements();
+        unsigned EltSize = DL.getTypeSizeInBits(VecTy->getElementType());
+        if (NumElts * EltSize % AllocatedEltTySize != 0)
           return std::nullopt;
         StoreInfos.emplace_back(SI, S.beginOffset(), S.endOffset(),
                                 SI->getValueOperand());
diff --git a/llvm/test/Transforms/SROA/vector-promotion-cannot-tree-structure-merge.ll b/llvm/test/Transforms/SROA/vector-promotion-cannot-tree-structure-merge.ll
index c858d071451e8..ead6e027ed37c 100644
--- a/llvm/test/Transforms/SROA/vector-promotion-cannot-tree-structure-merge.ll
+++ b/llvm/test/Transforms/SROA/vector-promotion-cannot-tree-structure-merge.ll
@@ -219,4 +219,18 @@ entry:
 
 }
 
+define <1 x i32> @test_store_value_size_not_multiple_of_allocated_element_type_size(<1 x i16> %a, <1 x i16> %b) {
+entry:
+  %alloca = alloca [2 x i16]
+
+  %ptr0 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 0
+  store <1 x i16> %a, ptr %ptr0
+
+  %ptr1 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 1
+  store <1 x i16> %b, ptr %ptr1
+
+  %result = load <1 x i32>, ptr %alloca
+  ret <1 x i32> %result
+}
+
 declare void @llvm.memset.p0.i64(ptr nocapture writeonly, i8, i64, i1 immarg)

github-actions · 2025-10-10T21:15:33Z

✅ With the latest revision this PR passed the C/C++ code formatter.

mjulian31

LGTM

llvm/lib/Transforms/Scalar/SROA.cpp

…/llvm-project into chengjunp/fix_sroa_bug

@foo

…2921) The change fixes a bug in the SROA where tree-structured merge optimization was incorrectly applied when the size of the stored value was not a multiple of the new allocated element type size. The original change is llvm#152793. A simple repro would be ``` define <1 x i32> @foo(<1 x i16> %a, <1 x i16> %b) { entry: %alloca = alloca [1 x i32] %ptr0 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 0 store <1 x i16> %a, ptr %ptr0 %ptr1 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 1 store <1 x i16> %b, ptr %ptr1 %result = load <1 x i32>, ptr %alloca ret <1 x i32> %result } ``` Currently, this will lead to a compile time crash. In this change, we will skip the tree-structured merge for this case and fall back to normal SROA.

@foo

…2921) The change fixes a bug in the SROA where tree-structured merge optimization was incorrectly applied when the size of the stored value was not a multiple of the new allocated element type size. The original change is llvm#152793. A simple repro would be ``` define <1 x i32> @foo(<1 x i16> %a, <1 x i16> %b) { entry: %alloca = alloca [1 x i32] %ptr0 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 0 store <1 x i16> %a, ptr %ptr0 %ptr1 = getelementptr inbounds [2 x i16], ptr %alloca, i32 0, i32 1 store <1 x i16> %b, ptr %ptr1 %result = load <1 x i32>, ptr %alloca ret <1 x i32> %result } ``` Currently, this will lead to a compile time crash. In this change, we will skip the tree-structured merge for this case and fall back to normal SROA.

Fix SROA issue

db3efa1

Chengjunp requested a review from Prince781 October 10, 2025 21:12

Chengjunp self-assigned this Oct 10, 2025

llvmbot added the llvm:transforms label Oct 10, 2025

Chengjunp mentioned this pull request Oct 10, 2025

[SROA] Use tree-structure merge to remove alloca #152793

Merged

Format

3c2410e

Chengjunp requested a review from mjulian31 October 10, 2025 21:16

Merge branch 'main' into chengjunp/fix_sroa_bug

cb2a28a

mjulian31 approved these changes Oct 10, 2025

View reviewed changes

Prince781 approved these changes Oct 10, 2025

View reviewed changes

llvm/lib/Transforms/Scalar/SROA.cpp Outdated Show resolved Hide resolved

Chengjunp added 2 commits October 10, 2025 23:51

Update comments

01510ed

Merge branch 'chengjunp/fix_sroa_bug' of https://github.com/Chengjunp…

f0d90fc

…/llvm-project into chengjunp/fix_sroa_bug

Chengjunp enabled auto-merge (squash) October 10, 2025 23:53

Chengjunp merged commit 8faeed0 into llvm:main Oct 11, 2025
7 of 9 checks passed

This was referenced Oct 17, 2025

[EXTERNAL][SROA] Add Stored Value Size Check for Tree-Structured Merge ROCm/rocMLIR#2041

Merged

[7.1][EXTERNAL][SROA] Add Stored Value Size Check for Tree-Structured Merge ROCm/rocMLIR#2044

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SROA] Add Stored Value Size Check for Tree-Structured Merge #162921

[SROA] Add Stored Value Size Check for Tree-Structured Merge #162921

Uh oh!

Chengjunp commented Oct 10, 2025 •

edited

Loading

Uh oh!

llvmbot commented Oct 10, 2025

Uh oh!

github-actions bot commented Oct 10, 2025 •

edited

Loading

Uh oh!

mjulian31 left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SROA] Add Stored Value Size Check for Tree-Structured Merge #162921

[SROA] Add Stored Value Size Check for Tree-Structured Merge #162921

Uh oh!

Conversation

Chengjunp commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Oct 10, 2025

Uh oh!

github-actions bot commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mjulian31 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Chengjunp commented Oct 10, 2025 •

edited

Loading

github-actions bot commented Oct 10, 2025 •

edited

Loading