Commit 9c9e621
committed
Improve performance of vector broadcast in SVE2
Modified codegen of vector broadcast in SVE2 to emit
TBL ARM intrin instead of llvm.vector.insert.
Fix performance test failure of nested_vectorization_gemm1 parent a7bc84b commit 9c9e621
1 file changed
+0
-13
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2067 | 2067 | | |
2068 | 2068 | | |
2069 | 2069 | | |
2070 | | - | |
2071 | | - | |
2072 | | - | |
2073 | | - | |
2074 | | - | |
2075 | | - | |
2076 | | - | |
2077 | | - | |
2078 | | - | |
2079 | | - | |
2080 | | - | |
2081 | | - | |
2082 | | - | |
2083 | 2070 | | |
2084 | 2071 | | |
2085 | 2072 | | |
| |||
0 commit comments