[SelectionDAGBuilder] Propagate fast-math flags to fpext #167574

mikolaj-pirog · 2025-11-11T20:38:32Z

As in title. Without this, fpext behaves in selectionDAG as always having no fast-math flags.

llvmbot · 2025-11-11T20:39:07Z

@llvm/pr-subscribers-backend-powerpc

@llvm/pr-subscribers-llvm-selectiondag

Author: Mikołaj Piróg (mikolaj-pirog)

Changes

As in title. Without this, fpext behaves in selectionDAG as always having no fmf flags.

Full diff: https://github.com/llvm/llvm-project/pull/167574.diff

1 Files Affected:

(modified) llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp (+4-1)

diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
index 9baf72b266aa7..16f555b16a621 100644
--- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
@@ -3976,7 +3976,10 @@ void SelectionDAGBuilder::visitFPExt(const User &I) {
   SDValue N = getValue(I.getOperand(0));
   EVT DestVT = DAG.getTargetLoweringInfo().getValueType(DAG.getDataLayout(),
                                                         I.getType());
-  setValue(&I, DAG.getNode(ISD::FP_EXTEND, getCurSDLoc(), DestVT, N));
+  SDNodeFlags Flags;
+  if (auto *TruncInst = dyn_cast<FPMathOperator>(&I))
+    Flags.copyFMF(*TruncInst);
+  setValue(&I, DAG.getNode(ISD::FP_EXTEND, getCurSDLoc(), DestVT, N, Flags));
 }
 
 void SelectionDAGBuilder::visitFPToUI(const User &I) {

mikolaj-pirog · 2025-11-11T20:47:20Z

I'm seeing no testing failures on this one, meaning no code path checks in meaningful way fmf on fpext nodes. I run into this issue while working on strengthening fmf semantics

arsenm

Should be testable? fpext flags are new, I'd assume that would have covered this?

mikolaj-pirog · 2025-11-11T22:54:37Z

Should be testable? fpext flags are new, I'd assume that would have covered this?

I don't think we have explicit testing for this stuff (i.e. if flags propagate correctly on DAG construction), I couldn't find an easy to test it either. I believe other flags get naturally tested if they work with tests for respective transformations. Would writing a unittest for it be an overkill?

I've filled a PR that requires this fixes, so that could be repurposed as a test for it

arsenm · 2025-11-12T05:26:27Z

Should be testable? fpext flags are new, I'd assume that would have covered this?

I don't think we have explicit testing for this stuff (i.e. if flags propagate correctly on DAG construction), I couldn't find an easy to test it either. I believe other flags get naturally tested if they work with tests for respective transformations. Would writing a unittest for it be an overkill?

The brute force way would be use -stop-after=finalize-isel and check the MIR, but it would be better to have a transform dependent on the flags

I've filled a PR that requires this fixes, so that could be repurposed as a test for it

That one's big and not obviously related to fpext, I'd rather test it here. Is there some other context the flag will do something?

mikolaj-pirog · 2025-11-12T10:15:19Z

Should be testable? fpext flags are new, I'd assume that would have covered this?

I don't think we have explicit testing for this stuff (i.e. if flags propagate correctly on DAG construction), I couldn't find an easy to test it either. I believe other flags get naturally tested if they work with tests for respective transformations. Would writing a unittest for it be an overkill?

The brute force way would be use -stop-after=finalize-isel and check the MIR, but it would be better to have a transform dependent on the flags

I've filled a PR that requires this fixes, so that could be repurposed as a test for it

That one's big and not obviously related to fpext, I'd rather test it here. Is there some other context the flag will do something?

Some combines peek through FP_EXTEND to perform fmf rewrite, but none checks if FP_EXTEND has any flags. I could add this check to one of the combines and update the tests. DAGCombiner::isContractableFMUL is one of these combines.

github-actions · 2025-11-13T10:55:24Z

✅ With the latest revision this PR passed the C/C++ code formatter.

mikolaj-pirog · 2025-11-13T13:41:51Z

Should be testable? fpext flags are new, I'd assume that would have covered this?

I don't think we have explicit testing for this stuff (i.e. if flags propagate correctly on DAG construction), I couldn't find an easy to test it either. I believe other flags get naturally tested if they work with tests for respective transformations. Would writing a unittest for it be an overkill?

The brute force way would be use -stop-after=finalize-isel and check the MIR, but it would be better to have a transform dependent on the flags

I've filled a PR that requires this fixes, so that could be repurposed as a test for it

That one's big and not obviously related to fpext, I'd rather test it here. Is there some other context the flag will do something?

The FPEXT in fdiv from other PR is a good candidate for testing this -- I've included checking arcp for FPEXT node here. One test caught the difference -- would that suffice for testing?

bjope · 2025-11-16T17:17:00Z

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

                                                        I.getType());
-  setValue(&I, DAG.getNode(ISD::FP_EXTEND, getCurSDLoc(), DestVT, N));
+  SDNodeFlags Flags;
+  if (auto *TruncInst = dyn_cast<FPMathOperator>(&I))


TruncInst is confusing here (I guess it has been copied from somewhere). You could perhaps all it Operator or FPExtInst or something, but it seems like FPOp is a common variable name when dyn_cast-ing to a FPMathOperator. So I would suggest FPOp.

You are right, I will follow-up with a correct name

s-watanabe314 · 2025-11-18T01:35:22Z

It seems our tests started failing after this patch. I'm not familiar with IR or DAG, so I'm not sure if this patch is the cause, if it's an optimization issue, or if this is the correct behavior. Any comments would be greatly appreciated.

Here's the C program in question:
https://godbolt.org/z/jqM5foWEj

#include <stdio.h>
#include <complex.h>

_Complex double XD = (3.123457687 + 0i);
_Complex double YD = (6.123457687 + 0i);

int main() {
  _Complex float cf;
  double real;

  real = creal(XD + YD);
  // Expect double-precision output
  printf("%40.35lf\n", real);
  
  cf = XD + YD;
  // Expect single-precision output
  printf("%40.35f\n",crealf(cf));
  
  return 0;
}

In this program, we expect a single-precision output on line 17, but when -ffast-math is specified, it outputs a double-precision value. Looking at the IR, fptrunc and fpext have fast flags, and it seems that the conversion from double-precision to single-precision is removed because of this.

Is this the intended behavior? Is such an optimization with the fast-math flag allowed?

mikolaj-pirog · 2025-11-18T18:26:13Z

From quick look at the example it appears to be okay. In C vararg functions like printf promote float to doubles. The reason it was working differently before is probably this change -- now fpext has correctly set fast-math while previously it always had none. The effect of this was that if fpext was being rewritten with some other nodes, to a new node, this new node had an intersection of flags, but since fpext had no flags, this new node had none as well -- meaning some rewrites couldn't be made on this new node because it had no flags -- and now they can be made.

So in other words: promoting floats to double in this case is okay because of how printf works. The change in value is probably because this patch allowed (or in better words -- stopped blocking) some rewrites from kicking in. Keep in mind this patch doesn't provide new features, it merely fixes what was broken before (i.e. fpext, internally, never had any flags set).

s-watanabe314 · 2025-11-19T01:14:39Z

Thanks for the reply. I understand that this patch itself is correct.

However, since the value to be printed is the return value of crealf, I believe that values exceeding the float range should be truncated. In other words, I'm concerned about whether it's okay for the return value of crealf to be double. This might be related to the C language standard. Additionally, the documentation for the fast flag in IR does not mention optimizations that would offset type conversions, so I'm unsure if this kind of optimization is permissible. Even if there are no problems with this patch, is it possible that other optimizations could introduce issues?

arsenm · 2025-11-19T01:25:09Z

Additionally, the documentation for the fast flag in IR does not mention optimizations that would offset type conversions, so I'm unsure if this kind of optimization is permissible.

Basically anything and everything is permissible with fast math flags. It's explicitly opting out of any standard to do something faster. It's not going to micro-disallow specific conversions or anything like that

s-watanabe314 · 2025-11-19T06:30:45Z

From Clang's perspective, this optimization appears permissible because -ffast-math, which enables aggressive optimizations, is specified. From an IR perspective, it also seems acceptable, as this optimization is permitted by the semantics of the fast flags. Is this understanding correct? My interpretation at the IR level is that the contract flags (which is implied by 'fast') led to the removal of fptrunc and fpext in this case.

mikolaj-pirog · 2025-11-19T22:21:13Z

From Clang's perspective, this optimization appears permissible because -ffast-math, which enables aggressive optimizations, is specified. From an IR perspective, it also seems acceptable, as this optimization is permitted by the semantics of the fast flags. Is this understanding correct? My interpretation at the IR level is that the contract flags (which is implied by 'fast') led to the removal of fptrunc and fpext in this case.

Yes, clang sees fast enabled from CLI and puts fast on IR instructions, backend sees fast flag on IR and does various fast-math rewrites¹. It looks like the contract is the flag responsible, but I haven't tested it.

There are also global flags that dictate fast-math behavior, i.e. backend may consult global flag instead of IR flag. There's active work being done to make backend rely only on instruction-level flags ↩

Propagate fast-math flags through fpext

2cc7c09

mikolaj-pirog requested review from arsenm and paperchalice November 11, 2025 20:38

llvmbot added the llvm:SelectionDAG SelectionDAGISel as well label Nov 11, 2025

arsenm reviewed Nov 11, 2025

View reviewed changes

mikolaj-pirog mentioned this pull request Nov 11, 2025

[DAGCombiner] Honor rewrite semantics of fast-math flags in fdiv combine #167595

Open

Check for arcp on fpext on fdiv combine

bf228bb

llvmbot added the backend:PowerPC label Nov 13, 2025

Formatting

31250ab

mikolaj-pirog requested a review from arsenm November 13, 2025 13:41

arsenm approved these changes Nov 15, 2025

View reviewed changes

arsenm merged commit e7b41df into llvm:main Nov 15, 2025
10 checks passed

bjope reviewed Nov 16, 2025

View reviewed changes

[SelectionDAGBuilder] Propagate fast-math flags to fpext #167574

[SelectionDAGBuilder] Propagate fast-math flags to fpext #167574

Uh oh!

Conversation

mikolaj-pirog commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikolaj-pirog commented Nov 11, 2025

Uh oh!

arsenm left a comment

Choose a reason for hiding this comment

Uh oh!

mikolaj-pirog commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arsenm commented Nov 12, 2025

Uh oh!

mikolaj-pirog commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikolaj-pirog commented Nov 13, 2025

Uh oh!

Uh oh!

bjope Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

mikolaj-pirog Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

s-watanabe314 commented Nov 18, 2025

Uh oh!

mikolaj-pirog commented Nov 18, 2025

Uh oh!

s-watanabe314 commented Nov 19, 2025

Uh oh!

arsenm commented Nov 19, 2025

Uh oh!

s-watanabe314 commented Nov 19, 2025

Uh oh!

mikolaj-pirog commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Footnotes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mikolaj-pirog commented Nov 11, 2025 •

edited

Loading

llvmbot commented Nov 11, 2025 •

edited

Loading

mikolaj-pirog commented Nov 11, 2025 •

edited

Loading

github-actions bot commented Nov 13, 2025 •

edited

Loading

mikolaj-pirog commented Nov 19, 2025 •

edited

Loading