Skip to content

Commit a854d6c

Browse files
authored
🔥[Tensor Product] Acceleration of Tensor-Product Operations with Tensor Cores (#90)
1 parent 567f07c commit a854d6c

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -403,7 +403,7 @@ Awesome-LLM-Inference: A curated list of [📙Awesome LLM Inference Papers with
403403
|2024.08|🔥🔥[Kraken] Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference(@Princeton) | [[pdf]](https://arxiv.org/pdf/2408.07802)|⚠️|⭐️ |
404404
|2024.08|🔥🔥[**FLA**] FLA: A Triton-Based Library for Hardware-Efficient Implementations of Linear Attention Mechanism(@sustcsonglin)| [[docs]](https://github.com/sustcsonglin/flash-linear-attention) |[[flash-linear-attention]](https://github.com/sustcsonglin/flash-linear-attention) ![](https://img.shields.io/github/stars/sustcsonglin/flash-linear-attention.svg?style=social)|⭐️⭐️ |
405405

406-
### 📖GEMM/Tensor Cores/WMMA/Parallel ([©️back👆🏻](#paperlist))
406+
### 📖GEMM/Tensor Cores/MMA/Parallel ([©️back👆🏻](#paperlist))
407407
<div id="GEMM-Tensor-Cores-WMMA"></div>
408408

409409
|Date|Title|Paper|Code|Recom|
@@ -423,6 +423,7 @@ Awesome-LLM-Inference: A curated list of [📙Awesome LLM Inference Papers with
423423
|2024.09|🔥🔥[**TEE**]Confidential Computing on nVIDIA H100 GPU: A Performance Benchmark Study(@phala.network)|[[pdf]](https://arxiv.org/pdf/2409.03992)|⚠️|⭐️ |
424424
|2024.09|🔥🔥[**HiFloat8**] Ascend HiFloat8 Format for Deep Learning(@Huawei)|[[pdf]](https://arxiv.org/pdf/2409.16626)|⚠️|⭐️ |
425425
|2024.09|🔥🔥[**Tensor Cores**] Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores(@nju.edu.cn)|[[pdf]](https://arxiv.org/pdf/2409.17870)|⚠️|⭐️ |
426+
|2024.07|🔥🔥[**Tensor Product**] Acceleration of Tensor-Product Operations with Tensor Cores(@Heidelberg University)|[[pdf]](https://arxiv.org/pdf/2407.09621)|⚠️|⭐️ |
426427

427428
### 📖VLM/Position Embed/Others ([©️back👆🏻](#paperlist))
428429
<div id="Others"></div>

0 commit comments

Comments
 (0)