File tree Expand file tree Collapse file tree 1 file changed +5
-2
lines changed Expand file tree Collapse file tree 1 file changed +5
-2
lines changed Original file line number Diff line number Diff line change 9
9
<img src=https://img.shields.io/badge/License-GPLv3.0-turquoise.svg >
10
10
</div >
11
11
12
- 📒** CUDA-Learn-Notes** : 🎉CUDA&C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot、elementwise、softmax、layernorm、rmsnorm、histogram、relu、sigmoid etc.
12
+ 📒** CUDA-Learn-Notes** : 🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot、elementwise、softmax、layernorm、rmsnorm、histogram、relu、sigmoid etc. 更多资料,请关注本人知乎技术博客: [ DefTruth on ZhiHu] ( https://www.zhihu.com/people/qyjdef/posts )
13
+
13
14
14
15
> 想要我的财宝吗?想要的话可以全部给你,去找吧!我把所有财宝都放在那里!—— ** 哥尔·D·罗杰**
15
16
@@ -19,6 +20,7 @@ Most of my time now is focused on **LLM/VLM** Inference. Please check 📖[Aweso
19
20
20
21
## 0x00 📖 博客目录
21
22
23
+ <!-- -
22
24
### 📒 图解LLM推理优化
23
25
24
26
- vLLM Automatic Prefix Caching: Prefix + Generated KV Caching
@@ -46,8 +48,9 @@ Most of my time now is focused on **LLM/VLM** Inference. Please check 📖[Aweso
46
48
- TensorRT-LLM: TensorRT-LLM部署调优-指北
47
49
48
50

51
+ -->
49
52
50
- 更多资料,请关注本人知乎技术博客: [ DefTruth ] ( https://www.zhihu. com/people/qyjdef/posts )
53
+ < img width = " 1438 " alt = " image " src = " https://github. com/user-attachments/assets/0c5e5125-586f-43fa-8e8b-e2c61c1afbbe " >
51
54
52
55
### 📒 大模型/多模态/SD 推理优化
53
56
You can’t perform that action at this time.
0 commit comments