[mlir][linalg] Fix neutral elt for softmax #118952

oowekyala · 2024-12-06T10:32:51Z

The decomposition of linalg.softmax uses maxnumf, but the identity element that is used in the generated code is the one for maximumf. They are not the same, as the identity for maxnumf is NaN, while the one of maximumf is -Infty. This is wrong and prevents the maxnumf from being folded.

Related to #114595, which fixed the folder for maxnumf.

llvmbot · 2024-12-06T10:33:24Z

@llvm/pr-subscribers-mlir

Author: Clément Fournier (oowekyala)

Changes

The decomposition of linalg.softmax uses maxnumf, but the identity element that is used in the generated code is the one for maximumf. They are not the same, as the identity for maxnumf is NaN, while the one of maximumf is -Infty. This is wrong and prevents the maxnumf from being folded.

Related to #114595, which fixed the folder for maxnumf.

Full diff: https://github.com/llvm/llvm-project/pull/118952.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp (+1-1)
(modified) mlir/test/Dialect/Linalg/transform-op-decompose.mlir (+1-1)

diff --git a/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp b/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
index d9840e3923c4f7..133855cc389338 100644
--- a/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
+++ b/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
@@ -2892,7 +2892,7 @@ FailureOr<SmallVector<Value>> SoftmaxOp::decomposeOperation(OpBuilder &b) {
   dims.erase(dims.begin() + reductionDim);
   // Step 1: Compute max along dim.
   Value outputReduce = b.create<tensor::EmptyOp>(loc, dims, elementType);
-  Value neutralForMaxF = arith::getIdentityValue(arith::AtomicRMWKind::maximumf,
+  Value neutralForMaxF = arith::getIdentityValue(arith::AtomicRMWKind::maxnumf,
                                                  elementType, b, loc,
                                                  /*useOnlyFiniteValue=*/true);
   Value neutralForMaxFInit =
diff --git a/mlir/test/Dialect/Linalg/transform-op-decompose.mlir b/mlir/test/Dialect/Linalg/transform-op-decompose.mlir
index 2e211d2fa7dbe9..72acf43361f501 100644
--- a/mlir/test/Dialect/Linalg/transform-op-decompose.mlir
+++ b/mlir/test/Dialect/Linalg/transform-op-decompose.mlir
@@ -210,7 +210,7 @@ func.func @softmax(%arg0: tensor<2x16x32xf32>, %dst: tensor<2x16x32xf32>) -> ten
 // CHECK-LABEL:      func.func @softmax(
 // CHECK-SAME:           %[[ARG0:[a-zA-Z0-9_]+]]: tensor<2x16x32xf32>, %[[DST:[a-zA-Z0-9_]+]]: tensor<2x16x32xf32>) -> tensor<2x16x32xf32> {
 // CHECK-DAG:        %[[D1:.+]] = tensor.empty() : tensor<2x16xf32>
-// CHECK-DAG:        %[[CST:.+]] = arith.constant -3.40282347E+38 : f32
+// CHECK-DAG:        %[[CST:.+]] = arith.constant 0xFFC00000 : f32
 // CHECK:        %[[D2:.+]] = linalg.fill ins(%[[CST]] : f32) outs(%[[D1]] : tensor<2x16xf32>) -> tensor<2x16xf32>
 // CHECK:        %[[D3:.+]] = linalg.generic {indexing_maps = [#[[$MAP]], #[[$MAP1]]], iterator_types = ["parallel",
 // CHECK-SAME:     "parallel", "reduction"]} ins(%[[ARG0]] : tensor<2x16x32xf32>) outs(%[[D2]] : tensor<2x16xf32>) {

llvmbot · 2024-12-06T10:33:24Z

@llvm/pr-subscribers-mlir-linalg

Author: Clément Fournier (oowekyala)

Changes

The decomposition of linalg.softmax uses maxnumf, but the identity element that is used in the generated code is the one for maximumf. They are not the same, as the identity for maxnumf is NaN, while the one of maximumf is -Infty. This is wrong and prevents the maxnumf from being folded.

Related to #114595, which fixed the folder for maxnumf.

Full diff: https://github.com/llvm/llvm-project/pull/118952.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp (+1-1)
(modified) mlir/test/Dialect/Linalg/transform-op-decompose.mlir (+1-1)

diff --git a/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp b/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
index d9840e3923c4f7..133855cc389338 100644
--- a/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
+++ b/mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp
@@ -2892,7 +2892,7 @@ FailureOr<SmallVector<Value>> SoftmaxOp::decomposeOperation(OpBuilder &b) {
   dims.erase(dims.begin() + reductionDim);
   // Step 1: Compute max along dim.
   Value outputReduce = b.create<tensor::EmptyOp>(loc, dims, elementType);
-  Value neutralForMaxF = arith::getIdentityValue(arith::AtomicRMWKind::maximumf,
+  Value neutralForMaxF = arith::getIdentityValue(arith::AtomicRMWKind::maxnumf,
                                                  elementType, b, loc,
                                                  /*useOnlyFiniteValue=*/true);
   Value neutralForMaxFInit =
diff --git a/mlir/test/Dialect/Linalg/transform-op-decompose.mlir b/mlir/test/Dialect/Linalg/transform-op-decompose.mlir
index 2e211d2fa7dbe9..72acf43361f501 100644
--- a/mlir/test/Dialect/Linalg/transform-op-decompose.mlir
+++ b/mlir/test/Dialect/Linalg/transform-op-decompose.mlir
@@ -210,7 +210,7 @@ func.func @softmax(%arg0: tensor<2x16x32xf32>, %dst: tensor<2x16x32xf32>) -> ten
 // CHECK-LABEL:      func.func @softmax(
 // CHECK-SAME:           %[[ARG0:[a-zA-Z0-9_]+]]: tensor<2x16x32xf32>, %[[DST:[a-zA-Z0-9_]+]]: tensor<2x16x32xf32>) -> tensor<2x16x32xf32> {
 // CHECK-DAG:        %[[D1:.+]] = tensor.empty() : tensor<2x16xf32>
-// CHECK-DAG:        %[[CST:.+]] = arith.constant -3.40282347E+38 : f32
+// CHECK-DAG:        %[[CST:.+]] = arith.constant 0xFFC00000 : f32
 // CHECK:        %[[D2:.+]] = linalg.fill ins(%[[CST]] : f32) outs(%[[D1]] : tensor<2x16xf32>) -> tensor<2x16xf32>
 // CHECK:        %[[D3:.+]] = linalg.generic {indexing_maps = [#[[$MAP]], #[[$MAP1]]], iterator_types = ["parallel",
 // CHECK-SAME:     "parallel", "reduction"]} ins(%[[ARG0]] : tensor<2x16x32xf32>) outs(%[[D2]] : tensor<2x16xf32>) {

MaheshRavishankar

Thanks!

The decomposition of `linalg.softmax` uses `maxnumf`, but the identity element that is used in the generated code is the one for `maximumf`. They are not the same, as the identity for `maxnumf` is `NaN`, while the one of `maximumf` is `-Infty`. This is wrong and prevents the maxnumf from being folded. Related to llvm#114595, which fixed the folder for maxnumf.

Fix neutral elt for softmax

98506b2

oowekyala requested review from dcaballe, nicolasvasilache and rengolin as code owners December 6, 2024 10:32

llvmbot added mlir:linalg mlir labels Dec 6, 2024

MaheshRavishankar approved these changes Dec 6, 2024

View reviewed changes

CoTinker approved these changes Jan 13, 2025

View reviewed changes

CoTinker merged commit 36c3466 into llvm:main Jan 13, 2025
11 checks passed

oowekyala deleted the fix-linalg-softmax-neutral branch January 13, 2025 08:53

nirvedhmeshram mentioned this pull request Jan 20, 2025

Integrate llvm 1_20_2025 iree-org/iree#19740

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][linalg] Fix neutral elt for softmax #118952

[mlir][linalg] Fix neutral elt for softmax #118952

Uh oh!

oowekyala commented Dec 6, 2024

Uh oh!

llvmbot commented Dec 6, 2024

Uh oh!

llvmbot commented Dec 6, 2024

Uh oh!

MaheshRavishankar left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[mlir][linalg] Fix neutral elt for softmax #118952

[mlir][linalg] Fix neutral elt for softmax #118952

Uh oh!

Conversation

oowekyala commented Dec 6, 2024

Uh oh!

llvmbot commented Dec 6, 2024

Uh oh!

llvmbot commented Dec 6, 2024

Uh oh!

MaheshRavishankar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants