Commit a24fae3
authored
[AMDGPU][True16][GlobalISel] Fix v2*16 build_vector patterns (#151496)
- Pattern with IMPLICIT_DEF failed to generate an entry in MatchTable and
did not report an error, just silently failed. This is fixed by casting
IMPLICIT_DEF to appropriate type. This also fixes selecting
"build_vector s16, undef" for GlobalISel with True16.
- Add pattern for "build_vector undef, s16" that will work for GlobalISel.
True16 GlobalISel has a G_TRUNC that it needs to deal with.
- Use REG_SEQUENCE for Real16 patterns instead of V_LSHLREV_B32_e64 to
generate more optimal code.1 parent 8cc4c6d commit a24fae3
File tree
10 files changed
+675
-176
lines changed- llvm
- lib/Target/AMDGPU
- test/CodeGen/AMDGPU
10 files changed
+675
-176
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3543 | 3543 | | |
3544 | 3544 | | |
3545 | 3545 | | |
3546 | | - | |
3547 | 3546 | | |
3548 | 3547 | | |
3549 | 3548 | | |
3550 | 3549 | | |
3551 | 3550 | | |
| 3551 | + | |
3552 | 3552 | | |
3553 | | - | |
| 3553 | + | |
| 3554 | + | |
| 3555 | + | |
| 3556 | + | |
| 3557 | + | |
| 3558 | + | |
| 3559 | + | |
| 3560 | + | |
3554 | 3561 | | |
3555 | 3562 | | |
3556 | 3563 | | |
| |||
3599 | 3606 | | |
3600 | 3607 | | |
3601 | 3608 | | |
3602 | | - | |
| 3609 | + | |
| 3610 | + | |
| 3611 | + | |
| 3612 | + | |
| 3613 | + | |
3603 | 3614 | | |
3604 | 3615 | | |
3605 | 3616 | | |
| |||
0 commit comments