@@ -14,7 +14,7 @@ Legend:
14
14
15
15
| Operation | BLAS | CPU | CUDA | Metal | SYCL | Vulkan |
16
16
| -----------| ------| ------| ------| ------| ------| ------|
17
- | ABS | ❌ | ✅ | 🟡 | ❌ | 🟡 | ❌ |
17
+ | ABS | ❌ | ✅ | 🟡 | 🟡 | 🟡 | ❌ |
18
18
| ACC | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
19
19
| ADD | ❌ | ✅ | ✅ | 🟡 | ✅ | ✅ |
20
20
| ADD1 | ❌ | ✅ | ✅ | ❌ | ✅ | ❌ |
@@ -37,7 +37,7 @@ Legend:
37
37
| DIV | ❌ | ✅ | ✅ | 🟡 | ✅ | ✅ |
38
38
| DUP | ❌ | ✅ | 🟡 | 🟡 | ✅ | 🟡 |
39
39
| ELU | ❌ | ✅ | 🟡 | 🟡 | 🟡 | ❌ |
40
- | EXP | ❌ | ✅ | 🟡 | ❌ | 🟡 | ❌ |
40
+ | EXP | ❌ | ✅ | 🟡 | 🟡 | 🟡 | ❌ |
41
41
| FLASH_ATTN_EXT | ❌ | ✅ | 🟡 | 🟡 | ❌ | 🟡 |
42
42
| GATED_LINEAR_ATTN | ❌ | ✅ | ✅ | ❌ | ✅ | ❌ |
43
43
| GEGLU | ❌ | ✅ | ✅ | 🟡 | ✅ | 🟡 |
@@ -49,8 +49,8 @@ Legend:
49
49
| GET_ROWS | ❌ | ✅ | 🟡 | ✅ | 🟡 | 🟡 |
50
50
| GET_ROWS_BACK | ❌ | 🟡 | 🟡 | ❌ | ❌ | ❌ |
51
51
| GROUP_NORM | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
52
- | HARDSIGMOID | ❌ | ✅ | 🟡 | ❌ | 🟡 | ❌ |
53
- | HARDSWISH | ❌ | ✅ | 🟡 | ❌ | 🟡 | ❌ |
52
+ | HARDSIGMOID | ❌ | ✅ | 🟡 | 🟡 | 🟡 | ❌ |
53
+ | HARDSWISH | ❌ | ✅ | 🟡 | 🟡 | 🟡 | ❌ |
54
54
| IM2COL | ❌ | ✅ | ✅ | 🟡 | ✅ | ✅ |
55
55
| L2_NORM | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
56
56
| LEAKY_RELU | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
@@ -72,8 +72,8 @@ Legend:
72
72
| REPEAT_BACK | ❌ | ✅ | ✅ | ❌ | ❌ | ✅ |
73
73
| RMS_NORM | ❌ | ✅ | ✅ | 🟡 | ✅ | ✅ |
74
74
| RMS_NORM_BACK | ❌ | ✅ | ✅ | ❌ | ❌ | ✅ |
75
- | RMS_NORM_MUL | ❌ | ❌ | ❌ | ✅ | ❌ | ❌ |
76
- | RMS_NORM_MUL_ADD | ❌ | ✅ | ✅ | ❌ | ✅ | ✅ |
75
+ | RMS_NORM_MUL | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
76
+ | RMS_NORM_MUL_ADD | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
77
77
| ROLL | ❌ | ✅ | ❌ | ❌ | ❌ | ✅ |
78
78
| ROPE | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
79
79
| ROPE_BACK | ❌ | ✅ | ✅ | ❌ | ❌ | ✅ |
@@ -82,7 +82,7 @@ Legend:
82
82
| SCALE | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
83
83
| SET | ❌ | ✅ | ❌ | ✅ | ❌ | ❌ |
84
84
| SET_ROWS | ❌ | 🟡 | 🟡 | 🟡 | 🟡 | 🟡 |
85
- | SGN | ❌ | ✅ | 🟡 | ❌ | 🟡 | ❌ |
85
+ | SGN | ❌ | ✅ | 🟡 | 🟡 | 🟡 | ❌ |
86
86
| SIGMOID | ❌ | ✅ | 🟡 | 🟡 | 🟡 | 🟡 |
87
87
| SILU | ❌ | ✅ | 🟡 | 🟡 | 🟡 | 🟡 |
88
88
| SILU_BACK | ❌ | ✅ | ✅ | ❌ | ❌ | ✅ |
@@ -93,7 +93,7 @@ Legend:
93
93
| SQRT | ❌ | ✅ | ✅ | 🟡 | ✅ | ❌ |
94
94
| SSM_CONV | ❌ | ✅ | ✅ | ✅ | ❌ | ❌ |
95
95
| SSM_SCAN | ❌ | ✅ | ✅ | ✅ | ❌ | ❌ |
96
- | STEP | ❌ | ✅ | 🟡 | ❌ | 🟡 | ❌ |
96
+ | STEP | ❌ | ✅ | 🟡 | 🟡 | 🟡 | ❌ |
97
97
| SUB | ❌ | ✅ | ✅ | 🟡 | ✅ | ✅ |
98
98
| SUM | ❌ | ✅ | ✅ | ❌ | ✅ | ✅ |
99
99
| SUM_ROWS | ❌ | ✅ | ✅ | ✅ | ✅ | ✅ |
0 commit comments