File tree Expand file tree Collapse file tree 1 file changed +8
-4
lines changed Expand file tree Collapse file tree 1 file changed +8
-4
lines changed Original file line number Diff line number Diff line change 13
13
14
14
<img width =" 1438 " alt =" image " src =" https://github.com/user-attachments/assets/0c5e5125-586f-43fa-8e8b-e2c61c1afbbe " >
15
15
16
- ## 🎉🎉 HGEMM/SGEMM Supported Matrix
16
+ ----
17
+ <h3 align =" center " >📖 HGEMM/SGEMM Supported Matrix </h3 >
17
18
18
19
| CUDA Cores| Sliced K(Loop over K)| Tile Block| Tile Thread|
19
20
| :---:| :---:| :---:| :---:|
27
28
| ** Row Major(NN)** | ** Col Major(TN)** | ** SGEMM TF32** | ** SMEM Swizzle** |
28
29
| ✔️| ✔️| ✔️| ❔|
29
30
30
- 🎉 Welcome to 🌟👆🏻star & submit a PR to this repo, as it is the simplest way to support me.
31
31
32
- ## 0x00 📖 CUDA Kernel目录 (面试常考题目)
32
+ <p align =" center " >🎉 Welcome to 🌟👆🏻star & submit a PR to this repo, as it is the simplest way to support me. </p >
33
+
34
+ ----
35
+
36
+ ## 📖 CUDA Kernel目录 (面试常考题目)
33
37
- / = not supported now.
34
38
- ✔️ = known work and already supported now.
35
39
- ❔ = in my plan, but not coming soon, maybe a few weeks later.
180
184
181
185
👉TIPS: * means using ** Tensor Cores(MMA/WMMA)** , otherwise, using CUDA Cores by default.
182
186
183
- ## 0x01 📖 博客目录
187
+ ## 📖 博客目录
184
188
185
189
### 📖 大模型|多模态|Diffusion|推理优化 (本人作者)
186
190
You can’t perform that action at this time.
0 commit comments