A test PR for #140694 while waiting for #149110 to complete #149824

chrisjbris · 2025-07-21T14:25:07Z

github-actions · 2025-07-21T14:34:42Z

✅ With the latest revision this PR passed the C/C++ code formatter.

chrisjbris · 2025-07-21T15:57:25Z

Rebased on to main to clear regression of ptradd-sdag-optimizations.ll.

Add to the VOP patterns to recognise when or/xor/and are modifying only the sign bit and replace with the appropriate srcmod.

64-bit wide instructions Make use of s_or_b64/s_and_b64/s_xor_b64 for v2i32. Legalising these causes a number of test regressions, so extra work in the combiner and Tablegen patterns was necessary. - Use custom for v2i32 rotr instead of additional patterns. Modify PerformOrCombine() to remove some identity or operations - Fix rotr regression by adding lowerRotr() on the legalizer codepath. - Add test case to rotr.ll - Extend performFNEGCombine() for the SELECT case. - Modify performSelectCombine() and foldFreeOpFromSelect to prevent the performFNEGCombine() changes from being unwound. - Add cases to or.ll and xor.ll to demonstrate the generation of the s_or_64 and s_xor_64 instructions for the v2i32 cases. Previously this was inhibited by "-amdgpu-scalarize-global-loads=false". - Fix shl/srl64_reduce regression by performing the scalarisation previously performewd by the vector legaliser in the combiner.

… line.

…aken place for tens of other tests.

This prevents any regressions in feng-modifier-casting.ll.

is made legal for or/xor/and. Complete fix of v2i32 in VOP SrcMod placement.

Factor shift reducing combine logic into one function as it was applied in all three shift combine functions.

chrisjbris self-assigned this Jul 21, 2025

chrisjbris changed the title ~~A test PR for #140694 while waiting for #149110 to be accepted~~ A test PR for #140694 while waiting for #149110 to complete Jul 21, 2025

chrisjbris force-pushed the 124775_AMDGPU_v2i32_bitwise_ops_VOP_rebase branch from 6ad3626 to b653568 Compare July 21, 2025 15:56

chrisjbris force-pushed the 124775_AMDGPU_v2i32_bitwise_ops_VOP_rebase branch 2 times, most recently from 0c3ddc5 to 145264a Compare July 21, 2025 16:21

chrisjbris added 23 commits July 22, 2025 05:27

[AMDGPU] Recognise bitmask operations as srcmods

c6c2bb7

Add to the VOP patterns to recognise when or/xor/and are modifying only the sign bit and replace with the appropriate srcmod.

Remove over-enthusiastic clang-format

d51a901

Respond to some review comments

61fa9f7

Add reviewer requested tests

675a024

Suppress over-enthusiastic clang-format

c0092d0

Temporarily remove r600 from or.ll test

f58cac5

Add SGPR and VGPR tests to and.ll and temporarily remove the r600 run…

7ce16e8

… line.

Remove dead check-lines from or.ll

109e482

Apply reviewer comments to performFNegCombine

e0f517f

Remove dead code

ede00b0

Re-enstate r600 tests in independent files. This action has already t…

c5904d8

…aken place for tens of other tests.

Remove unhelpful commentary.

c5b767c

Remove unnecessary driveby clang-format

3da5a3d

Remove dead checks in xor.ll

8c91bff

Remove unnnecessary node duplication

d60d011

Modify allUsesHaveSourceMods() instead of foldFreeOpFromSelect()

1d3f754

This prevents any regressions in feng-modifier-casting.ll.

Remove single-use variables from buildVectorSupportsSourceMods()

0ada80b

Correct failure to call getOpcode()

5a97e1c

Work to fix regressions in integer select srcmod generation when v2i32

573adfe

is made legal for or/xor/and. Complete fix of v2i32 in VOP SrcMod placement.

Fix 64-bit ashr scalarisation of and for fold int 32-bit shift

46786e7

Factor shift reducing combine logic into one function as it was applied in all three shift combine functions.

Tidy up getShiftForReduction()

7a6fe79

Remove driveby formatting fixes

09d745a

Fix formatting of xorcombine - how did this regress?

d789ece

chrisjbris force-pushed the 124775_AMDGPU_v2i32_bitwise_ops_VOP_rebase branch from 697f3cb to d789ece Compare July 22, 2025 10:28

Simpify SelectVOP3ModsImpl

cc8652e

chrisjbris closed this Aug 6, 2025

chrisjbris mentioned this pull request Aug 7, 2025

[AMDGPU] Recognise bitmask operations as srcmods on select #152119

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A test PR for #140694 while waiting for #149110 to complete #149824

A test PR for #140694 while waiting for #149110 to complete #149824

Uh oh!

chrisjbris commented Jul 21, 2025

Uh oh!

github-actions bot commented Jul 21, 2025 •

edited

Loading

Uh oh!

chrisjbris commented Jul 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

A test PR for #140694 while waiting for #149110 to complete #149824

A test PR for #140694 while waiting for #149110 to complete #149824

Uh oh!

Conversation

chrisjbris commented Jul 21, 2025

Uh oh!

github-actions bot commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chrisjbris commented Jul 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot commented Jul 21, 2025 •

edited

Loading