Commit 99fb0fd
committed
[AArch64][LoopVectorize] Enable tail-folding on neoverse-v2
This patch enables tail-folding of simple loops by default when
targeting the neoverse-v2 CPU. This was done for neoverse-v1 in
c7dbe32.
For SPEC2017 with "-Ofast -mcpu=neoverse-v2 -flto" this gives some small
wins:
549.fotonik3d_r: ~3.2%
525.x264_r: ~2.7%
554.roms_r: ~1.2%1 parent 4cde945 commit 99fb0fd
File tree
2 files changed
+4
-0
lines changed- llvm
- lib/Target/AArch64
- test/Transforms/LoopVectorize/AArch64
2 files changed
+4
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
268 | 268 | | |
269 | 269 | | |
270 | 270 | | |
| 271 | + | |
| 272 | + | |
271 | 273 | | |
272 | 274 | | |
273 | 275 | | |
| |||
Lines changed: 2 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
| 14 | + | |
| 15 | + | |
14 | 16 | | |
15 | 17 | | |
16 | 18 | | |
| |||
0 commit comments