Commit a1987be
authored
Reland "[NVPTX] Prefer prmt.b32 over bfi.b32" (llvm#114326)
Fix
[failure](llvm#110766 (comment))
identified by @akuegel.
---
In [[NVPTX] Improve lowering of
v4i8](llvm@cbafb6f)
@Artem-B add the ability to lower ISD::BUILD_VECTOR with bfi PTX
instructions. @Artem-B did this because:
(llvm#67866 (comment))
Under the hood byte extraction/insertion ends up as BFI/BFE
instructions, so we may as well do that in PTX, too.
https://godbolt.org/z/Tb3zWbj9b
However, the example that @Artem-B linked was targeting sm_52. On modern
architectures, ptxas uses prmt.b32.
[Example](https://godbolt.org/z/Ye4W1n84o).
Thus, remove uses of NVPTXISD::BFI in favor of NVPTXISD::PRMT.1 parent 19b4f17 commit a1987be
File tree
3 files changed
+339
-334
lines changed- llvm
- lib/Target/NVPTX
- test/CodeGen/NVPTX
3 files changed
+339
-334
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2318 | 2318 | | |
2319 | 2319 | | |
2320 | 2320 | | |
2321 | | - | |
2322 | 2321 | | |
2323 | 2322 | | |
2324 | 2323 | | |
2325 | 2324 | | |
2326 | 2325 | | |
2327 | 2326 | | |
| 2327 | + | |
| 2328 | + | |
2328 | 2329 | | |
2329 | 2330 | | |
2330 | | - | |
2331 | | - | |
2332 | | - | |
2333 | | - | |
2334 | | - | |
2335 | | - | |
2336 | | - | |
2337 | | - | |
2338 | | - | |
2339 | | - | |
2340 | | - | |
2341 | | - | |
2342 | | - | |
2343 | | - | |
2344 | | - | |
2345 | | - | |
2346 | | - | |
| 2331 | + | |
| 2332 | + | |
| 2333 | + | |
| 2334 | + | |
| 2335 | + | |
| 2336 | + | |
| 2337 | + | |
| 2338 | + | |
| 2339 | + | |
| 2340 | + | |
| 2341 | + | |
| 2342 | + | |
| 2343 | + | |
| 2344 | + | |
| 2345 | + | |
| 2346 | + | |
| 2347 | + | |
2347 | 2348 | | |
2348 | 2349 | | |
2349 | 2350 | | |
| |||
2374 | 2375 | | |
2375 | 2376 | | |
2376 | 2377 | | |
2377 | | - | |
2378 | | - | |
| 2378 | + | |
| 2379 | + | |
2379 | 2380 | | |
2380 | 2381 | | |
2381 | 2382 | | |
| |||
0 commit comments