Commit ae09723
committed
[AArch64] Add FP8 Neon intrinsics for dot-product
THis patch adds the following intrinsics:
float16x4_t vdot_f16_mf8_fpm(float16x4_t vd, mfloat8x8_t vn, mfloat8x8_t vm, fpm_t fpm)
float16x8_t vdotq_f16_mf8_fpm(float16x8_t vd, mfloat8x16_t vn, mfloat8x16_t vm, fpm_t fpm)
float16x4_t vdot_lane_f16_mf8_fpm(float16x4_t vd, mfloat8x8_t vn, mfloat8x8_t vm, __builtin_constant_p(lane), fpm_t fpm)
float16x4_t vdot_laneq_f16_mf8_fpm(float16x4_t vd, mfloat8x8_t vn, mfloat8x16_t vm, __builtin_constant_p(lane), fpm_t fpm)
float16x8_t vdotq_lane_f16_mf8_fpm(float16x8_t vd, mfloat8x16_t vn, mfloat8x8_t vm, __builtin_constant_p(lane), fpm_t fpm)
float16x8_t vdotq_laneq_f16_mf8_fpm(float16x8_t vd, mfloat8x16_t vn, mfloat8x16_t vm, __builtin_constant_p(lane), fpm_t fpm)
[fixup] Remove not needed argument (NFC)
[fixup] Update intrinsics declarations
[fixup] Add C++ runs to tests, remove some opt passes1 parent 99bd2e3 commit ae09723
File tree
10 files changed
+529
-40
lines changed- clang
- include/clang/Basic
- lib/CodeGen
- test
- CodeGen/AArch64/fp8-intrinsics
- Sema/aarch64-fp8-intrinsics
- llvm
- include/llvm/IR
- lib/Target/AArch64
- test/CodeGen/AArch64
10 files changed
+529
-40
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2141 | 2141 | | |
2142 | 2142 | | |
2143 | 2143 | | |
| 2144 | + | |
| 2145 | + | |
| 2146 | + | |
| 2147 | + | |
| 2148 | + | |
| 2149 | + | |
| 2150 | + | |
| 2151 | + | |
| 2152 | + | |
| 2153 | + | |
| 2154 | + | |
| 2155 | + | |
| 2156 | + | |
| 2157 | + | |
| 2158 | + | |
| 2159 | + | |
| 2160 | + | |
| 2161 | + | |
| 2162 | + | |
| 2163 | + | |
2144 | 2164 | | |
2145 | 2165 | | |
2146 | 2166 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
302 | 302 | | |
303 | 303 | | |
304 | 304 | | |
305 | | - | |
| 305 | + | |
306 | 306 | | |
307 | 307 | | |
308 | 308 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6766 | 6766 | | |
6767 | 6767 | | |
6768 | 6768 | | |
| 6769 | + | |
| 6770 | + | |
| 6771 | + | |
| 6772 | + | |
| 6773 | + | |
| 6774 | + | |
| 6775 | + | |
| 6776 | + | |
| 6777 | + | |
| 6778 | + | |
| 6779 | + | |
| 6780 | + | |
| 6781 | + | |
| 6782 | + | |
| 6783 | + | |
| 6784 | + | |
| 6785 | + | |
| 6786 | + | |
6769 | 6787 | | |
6770 | 6788 | | |
6771 | 6789 | | |
| |||
12761 | 12779 | | |
12762 | 12780 | | |
12763 | 12781 | | |
| 12782 | + | |
12764 | 12783 | | |
12765 | 12784 | | |
12766 | 12785 | | |
| |||
14028 | 14047 | | |
14029 | 14048 | | |
14030 | 14049 | | |
| 14050 | + | |
| 14051 | + | |
| 14052 | + | |
| 14053 | + | |
| 14054 | + | |
| 14055 | + | |
| 14056 | + | |
| 14057 | + | |
| 14058 | + | |
| 14059 | + | |
| 14060 | + | |
| 14061 | + | |
| 14062 | + | |
| 14063 | + | |
| 14064 | + | |
| 14065 | + | |
| 14066 | + | |
| 14067 | + | |
| 14068 | + | |
| 14069 | + | |
| 14070 | + | |
| 14071 | + | |
| 14072 | + | |
| 14073 | + | |
| 14074 | + | |
14031 | 14075 | | |
14032 | 14076 | | |
14033 | 14077 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4699 | 4699 | | |
4700 | 4700 | | |
4701 | 4701 | | |
| 4702 | + | |
| 4703 | + | |
| 4704 | + | |
| 4705 | + | |
4702 | 4706 | | |
4703 | 4707 | | |
4704 | 4708 | | |
| |||
0 commit comments