File tree Expand file tree Collapse file tree 1 file changed +1
-0
lines changed
Expand file tree Collapse file tree 1 file changed +1
-0
lines changed Original file line number Diff line number Diff line change @@ -367,6 +367,7 @@ Awesome-LLM-Inference: A curated list of [📙Awesome LLM Inference Papers with
367367| 2023.05| 🔥🔥[ ** RWKV** ] RWKV: Reinventing RNNs for the Transformer Era(@Bo Peng etc) | [[ pdf]] ( https://arxiv.org/pdf/2305.13048.pdf ) | [[ RWKV-LM]] ( https://github.com/BlinkDL/RWKV-LM ) ![ ] ( https://img.shields.io/github/stars/BlinkDL/RWKV-LM.svg?style=social ) | ⭐️⭐️ |
368368| 2023.12| 🔥🔥[ ** Mamba** ] Mamba: Linear-Time Sequence Modeling with Selective State Spaces(@cs .cmu.edu etc) | [[ pdf]] ( https://arxiv.org/pdf/2312.00752.pdf ) | [[ mamba]] ( https://github.com/state-spaces/mamba ) ![ ] ( https://img.shields.io/github/stars/state-spaces/mamba.svg?style=social ) | ⭐️⭐️ |
369369| 2024.06| 🔥🔥[ ** RWKV-CLIP** ] RWKV-CLIP: A Robust Vision-Language Representation Learner(@DeepGlint etc) | [[ pdf]] ( https://arxiv.org/pdf/2406.06973 ) | [[ RWKV-CLIP]] ( https://github.com/deepglint/RWKV-CLIP ) ![ ] ( https://img.shields.io/github/stars/deepglint/RWKV-CLIP.svg?style=social ) | ⭐️⭐️ |
370+ | 2024.08| [ Kraken] Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference(@Princeton ) | [[ pdf]] ( https://arxiv.org/pdf/2408.07802 ) | ⚠️| ⭐️ |
370371
371372### 📖GEMM/Tensor Cores/WMMA/Parallel ([ ©️back👆🏻] ( #paperlist ) )
372373<div id =" GEMM-Tensor-Cores-WMMA " ></div >
You can’t perform that action at this time.
0 commit comments