-
Couldn't load subscription status.
- Fork 15k
[AMDGPU] expand-fp: always report modifications #163153
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[AMDGPU] expand-fp: always report modifications #163153
Conversation
The last change to the pass lost the assignment to the "Modified" variable for one of the pass optimizations. Add it back.
|
This fixes the test failure in |
| continue; | ||
|
|
||
| addToWorklist(I, Worklist); | ||
| Modified = true; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about removing all the assignments on add to worklist, and do it once when processing the worklist
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is possible provided scalarize always pushes to the Worklist, i.e. the instructions created in this function cannot be folded. This seems to hold true for the instructions handled in this pass. I have implemented the change accordingly in this commit. Does that make sense or should I perhaps pass NoFolder to the IRBuilder ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll revert the additional changes for now to fix the build bot failures. We can still add them in a follow-up PR.
This reverts commit 0ea427f.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks - fixes my local checks
The last change to the pass in PR llvm#158588 lost the assignment to the "Modified" variable for one of the pass optimizations. Add it back. This fixes the test failure in `CodeGen/AMDGPU/itofp.i128.bf.ll` (in a `LLVM_ENABLE_EXPENSIVE_CHECKS=ON` build).
The last change to the pass in PR #158588 lost the assignment to the "Modified" variable for one of the pass optimizations.
Add it back. This fixes the test failure in
CodeGen/AMDGPU/itofp.i128.bf.ll(in aLLVM_ENABLE_EXPENSIVE_CHECKS=ONbuild).