[MLIR] [Vector] Added canonicalizer for folding from_elements + transpose #161841

keshavvinayak01 · 2025-10-03T13:01:52Z

Description

Adds a new canonicalizer that folds vector.from_elements(vector.transpose)) => vector.from_elements. This canonicalization reorders the input elements for vector.from_elements, adjusts the output shape to match the effect of the transpose op and eliminating its need.

Testing

Added a 2D vector lit test that verifies the working of the rewrite.

…ctor.transpose) Signed-off-by: Keshav Vinayak Jha <[email protected]>

Signed-off-by: Keshav Vinayak Jha <[email protected]>

llvmbot · 2025-10-03T13:02:31Z

@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-vector

Author: Keshav Vinayak Jha (keshavvinayak01)

Changes

Description

Adds a new canonicalizer that folds vector.from_elements(vector.broadcast)) => vector.from_elements. This canonicalization reorders the input elements for vector.from_elements, adjusts the output shape to match the effect of the broadcast op and eliminating its need.

Testing

Added a 2D vector lit test that verifies the working of the rewrite.

Full diff: https://github.com/llvm/llvm-project/pull/161841.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Vector/IR/VectorOps.cpp (+57-1)
(modified) mlir/test/Dialect/Vector/canonicalize.mlir (+12)

diff --git a/mlir/lib/Dialect/Vector/IR/VectorOps.cpp b/mlir/lib/Dialect/Vector/IR/VectorOps.cpp
index b0132e889302f..7f6313c11ea18 100644
--- a/mlir/lib/Dialect/Vector/IR/VectorOps.cpp
+++ b/mlir/lib/Dialect/Vector/IR/VectorOps.cpp
@@ -6723,6 +6723,61 @@ class FoldTransposeShapeCast final : public OpRewritePattern<TransposeOp> {
   }
 };
 
+/// Folds transpose(from_elements(...)) into a new from_elements with permuted
+/// operands matching the transposed shape.
+class FoldTransposeFromElements final : public OpRewritePattern<TransposeOp> {
+public:
+  using Base::Base;
+  LogicalResult matchAndRewrite(vector::TransposeOp transposeOp,
+                                PatternRewriter &rewriter) const override {
+    auto fromElementsOp =
+        transposeOp.getVector().getDefiningOp<vector::FromElementsOp>();
+    if (!fromElementsOp)
+      return failure();
+
+    VectorType srcTy = fromElementsOp.getDest().getType();
+    VectorType dstTy = transposeOp.getType();
+
+    ArrayRef<int64_t> permutation = transposeOp.getPermutation();
+    int64_t rank = srcTy.getRank();
+
+    // Build inverse permutation to map destination indices back to source.
+    SmallVector<int64_t, 4> inversePerm(rank, 0);
+    for (int64_t i = 0; i < rank; ++i)
+      inversePerm[permutation[i]] = i;
+
+    ArrayRef<int64_t> srcShape = srcTy.getShape();
+    ArrayRef<int64_t> dstShape = dstTy.getShape();
+    SmallVector<int64_t, 4> srcIdx(rank, 0);
+    SmallVector<int64_t, 4> dstIdx(rank, 0);
+    SmallVector<int64_t, 4> srcStrides = computeStrides(srcShape);
+    SmallVector<int64_t, 4> dstStrides = computeStrides(dstShape);
+
+    auto elements = fromElementsOp.getElements();
+    SmallVector<Value> newElements;
+    int64_t dstNumElements = dstTy.getNumElements();
+    newElements.reserve(dstNumElements);
+
+    // For each element in destination row-major order, pick the corresponding
+    // source element.
+    for (int64_t lin = 0; lin < dstNumElements; ++lin) {
+      // Pick the destination element index.
+      dstIdx = delinearize(lin, dstStrides);
+      // Map the destination element index to the source element index.
+      for (int64_t j = 0; j < rank; ++j)
+        srcIdx[j] = dstIdx[inversePerm[j]];
+      // Linearize the source element index.
+      int64_t srcLin = linearize(srcIdx, srcStrides);
+      // Add the source element to the new elements.
+      newElements.push_back(elements[srcLin]);
+    }
+
+    rewriter.replaceOpWithNewOp<FromElementsOp>(transposeOp, dstTy,
+                                                newElements);
+    return success();
+  }
+};
+
 /// Folds transpose(broadcast(x)) to broadcast(x) if the transpose is
 /// 'order preserving', where 'order preserving' means the flattened
 /// inputs and outputs of the transpose have identical (numerical) values.
@@ -6823,7 +6878,8 @@ class FoldTransposeBroadcast : public OpRewritePattern<vector::TransposeOp> {
 void vector::TransposeOp::getCanonicalizationPatterns(
     RewritePatternSet &results, MLIRContext *context) {
   results.add<FoldTransposeCreateMask, FoldTransposeShapeCast, TransposeFolder,
-              FoldTransposeSplat, FoldTransposeBroadcast>(context);
+              FoldTransposeSplat, FoldTransposeFromElements,
+              FoldTransposeBroadcast>(context);
 }
 
 //===----------------------------------------------------------------------===//
diff --git a/mlir/test/Dialect/Vector/canonicalize.mlir b/mlir/test/Dialect/Vector/canonicalize.mlir
index 5448976f84760..5f34d144cd472 100644
--- a/mlir/test/Dialect/Vector/canonicalize.mlir
+++ b/mlir/test/Dialect/Vector/canonicalize.mlir
@@ -308,6 +308,18 @@ func.func @constant_mask_transpose_to_transposed_constant_mask() -> (vector<2x3x
 
 // -----
 
+// CHECK-LABEL: transpose_from_elements_2d
+func.func @transpose_from_elements_2d(%a0: i32, %a1: i32, %a2: i32,
+                                      %a3: i32, %a4: i32, %a5: i32) -> vector<3x2xi32> {
+  %v = vector.from_elements %a0, %a1, %a2, %a3, %a4, %a5 : vector<2x3xi32>
+  %t = vector.transpose %v, [1, 0] : vector<2x3xi32> to vector<3x2xi32>
+  return %t : vector<3x2xi32>
+  // CHECK: %[[R:.*]] = vector.from_elements %arg0, %arg3, %arg1, %arg4, %arg2, %arg5 : vector<3x2xi32>
+  // CHECK-NOT: vector.transpose
+}
+
+// -----
+
 func.func @extract_strided_slice_of_constant_mask() -> (vector<2x2xi1>) {
   %0 = vector.constant_mask [2, 2] : vector<4x3xi1>
   %1 = vector.extract_strided_slice %0

banach-space

Thanks!

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

banach-space · 2025-10-13T16:08:01Z

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

+
+    // For each element in destination row-major order, pick the corresponding
+    // source element.
+    for (int64_t lin = 0; lin < dstNumElements; ++lin) {


What does lin represent?

So "lin" is short for "linear index" - it's a 1D index that represents the position of an element when the multi-dimensional vector is laid out in row-major order in memory. I felt it was a good iter name.

I find lin a bit too enigmatic. Why not linearIdx?

mlir/test/Dialect/Vector/canonicalize.mlir

1. Minor nitpicks in code formatting. 2. More lit tests, convering 1D, 2D, 3D cases. Signed-off-by: Keshav Vinayak Jha <[email protected]>

banach-space · 2025-10-16T13:46:06Z

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

+
+    // For each element in destination row-major order, pick the corresponding
+    // source element.
+    for (int64_t lin = 0; lin < dstNumElements; ++lin) {


I find lin a bit too enigmatic. Why not linearIdx?

mlir/test/Dialect/Vector/canonicalize.mlir

Signed-off-by: Keshav Vinayak Jha <[email protected]>

banach-space · 2025-10-17T08:23:45Z

mlir/test/Dialect/Vector/canonicalize.mlir

+// CHECK-LABEL: transpose_from_elements_1d
+func.func @transpose_from_elements_1d(%el_0: i32, %el_1: i32) -> vector<2xi32> {
+  %v = vector.from_elements %el_0, %el_1 : vector<2xi32>
+  %t = vector.transpose %v, [0] : vector<2xi32> to vector<2xi32>
+  return %t : vector<2xi32>
+  // CHECK: %[[R:.*]] = vector.from_elements %[[EL_0:.*]], %[[EL_1:.*]] : vector<2xi32>
+  // CHECK-NOT: vector.transpose
+}


Variable R is defined, but never used. Please fix by adding return %[[R]].

Variables EL_0, EL_1 should be defined near function signature and then re-used here.

Specifically:

Suggested change

// CHECK-LABEL: transpose_from_elements_1d

func.func @transpose_from_elements_1d(%el_0: i32, %el_1: i32) -> vector<2xi32> {

%v = vector.from_elements %el_0, %el_1 : vector<2xi32>

%t = vector.transpose %v, [0] : vector<2xi32> to vector<2xi32>

return %t : vector<2xi32>

// CHECK: %[[R:.*]] = vector.from_elements %[[EL_0:.*]], %[[EL_1:.*]] : vector<2xi32>

// CHECK-NOT: vector.transpose

}

// CHECK-LABEL: transpose_from_elements_1d

// CHECK-SAME: %[[EL_0:.*]]: i32, %[[EL_1:.*]]: i32

func.func @transpose_from_elements_1d(%el_0: i32, %el_1: i32) -> vector<2xi32> {

%v = vector.from_elements %el_0, %el_1 : vector<2xi32>

%t = vector.transpose %v, [0] : vector<2xi32> to vector<2xi32>

return %t : vector<2xi32>

// CHECK: %[[R:.*]] = vector.from_elements %[[EL_0]], %[[EL_1]] : vector<2xi32>

// CHECK-NOT: vector.transpose

// CHECK: return %[[R]]

}

Similar comment for other tests. For more details, see e.g. https://llvm.org/docs/CommandGuide/FileCheck.html#filecheck-string-substitution-blocks

banach-space · 2025-10-17T08:28:22Z

mlir/test/Dialect/Vector/canonicalize.mlir

Please move the newly added tests near other tests for vector.from_elements, e.g. https://github.com/llvm/llvm-project/blob/main/mlir/test/Dialect/Vector/canonicalize.mlir

Also, please add block comments documenting what folder is tested. Examples:

llvm-project/mlir/test/Dialect/Vector/canonicalize.mlir

Lines 3365 to 3367 in b228a18

// +---------------------------------------------------------------------------

// Tests for foldFromElementsToConstant

// +---------------------------------------------------------------------------

llvm-project/mlir/test/Dialect/Vector/canonicalize.mlir

Lines 3527 to 3529 in b228a18

// +---------------------------------------------------------------------------

// End of Tests for foldFromElementsToConstant

// +---------------------------------------------------------------------------

1. Changed variable name of linearIdx iterator. 2. Moved canonicalizer lit tests to other vector.from_elements tests. 3. Added blocked comments signaling beginning, end, and name of the pattern. Signed-off-by: Keshav Vinayak Jha <[email protected]>

dcaballe · 2025-10-17T22:24:45Z

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

 };

+/// Folds transpose(from_elements(...)) into a new from_elements with permuted
+/// operands matching the transposed shape.


Could you add a before and after IR example? That usually helps a lot with understanding.

dcaballe · 2025-10-17T22:29:28Z

LGTM, thanks! (modulo pending comments)

keshavvinayak01 added 2 commits October 3, 2025 05:52

Added canonicalization (vector.from_elements + vector.transpose -> ve…

c94bbb7

…ctor.transpose) Signed-off-by: Keshav Vinayak Jha <[email protected]>

Formatted

6bef6d2

Signed-off-by: Keshav Vinayak Jha <[email protected]>

keshavvinayak01 requested review from Groverkss, banach-space, dcaballe and nicolasvasilache as code owners October 3, 2025 13:01

llvmbot added mlir:vectorops mlir mlir:vector labels Oct 3, 2025

keshavvinayak01 mentioned this pull request Oct 3, 2025

Request Commit Access For keshavvinayak01 #161149

Open

keshavvinayak01 changed the title ~~[Vector] Added canonicalizer for folding from_elements + transpose~~ [MLIR] [Vector] Added canonicalizer for folding from_elements + transpose Oct 3, 2025

banach-space reviewed Oct 13, 2025

View reviewed changes

Addressed comments:

70d3d8f

1. Minor nitpicks in code formatting. 2. More lit tests, convering 1D, 2D, 3D cases. Signed-off-by: Keshav Vinayak Jha <[email protected]>

keshavvinayak01 requested a review from banach-space October 16, 2025 11:00

banach-space reviewed Oct 16, 2025

View reviewed changes

Explainable arg names in lit test

617267b

Signed-off-by: Keshav Vinayak Jha <[email protected]>

keshavvinayak01 requested a review from banach-space October 16, 2025 15:08

banach-space reviewed Oct 17, 2025

View reviewed changes

Addressed Comments:

2889f3d

1. Changed variable name of linearIdx iterator. 2. Moved canonicalizer lit tests to other vector.from_elements tests. 3. Added blocked comments signaling beginning, end, and name of the pattern. Signed-off-by: Keshav Vinayak Jha <[email protected]>

keshavvinayak01 requested a review from banach-space October 17, 2025 13:14

dcaballe approved these changes Oct 17, 2025

View reviewed changes

	// +---------------------------------------------------------------------------
	// Tests for foldFromElementsToConstant
	// +---------------------------------------------------------------------------

	// +---------------------------------------------------------------------------
	// End of Tests for foldFromElementsToConstant
	// +---------------------------------------------------------------------------

[MLIR] [Vector] Added canonicalizer for folding from_elements + transpose #161841

Are you sure you want to change the base?

[MLIR] [Vector] Added canonicalizer for folding from_elements + transpose #161841

Uh oh!

Conversation

keshavvinayak01 commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Testing

Uh oh!

llvmbot commented Oct 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Testing

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

banach-space Oct 13, 2025

Choose a reason for hiding this comment

Uh oh!

keshavvinayak01 Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

banach-space Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

banach-space Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

banach-space Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

banach-space Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

dcaballe Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

dcaballe commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

keshavvinayak01 commented Oct 3, 2025 •

edited

Loading

llvmbot commented Oct 3, 2025 •

edited

Loading