Skip to content

Commit 38cbd5e

Browse files
committed
--
1 parent 1787b5a commit 38cbd5e

File tree

2 files changed

+12
-11
lines changed

2 files changed

+12
-11
lines changed

imgs/rsr.png

507 KB
Loading

index.htm

Lines changed: 12 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -118,7 +118,7 @@
118118
Born from high-end research, Needle is designed to be accessible to everyone while delivering top-notch performance.
119119
Whether you are a researcher, developer, or an enthusiast, Needle opens up innovative ways to explore your image datasets. ✨
120120
<div class="project-links">
121-
<a href="https://www.cs.uic.edu/~indexlab/Needle/" class="btn project-btn" target="_blank">
121+
<a href="https://www.cs.uic.edu/~indexlab/Needle" class="btn project-btn" target="_blank">
122122
🌐 Visit Project
123123
</a>
124124
<a href="https://github.com/UIC-InDeXLab/Needle" class="btn github-btn" target="_blank">
@@ -138,16 +138,17 @@
138138
</div>
139139
<hr/>
140140
<div class="row no-gutters">
141-
<!--<div class="col-md-3 text-center">
141+
<div class="col-md-3 text-center">
142142
<p></p>
143-
<img width="90%" src="imgs/needlelogo.png"/>
144-
</div>-->
145-
<div class="card-body">
143+
<img width="90%" src="imgs/rsr.png"/>
144+
</div>
145+
<div class="card-body col-md-9">
146146
<h6><b>RSR 🧮</b>: Efficient Matrix Multiplication for Accelerating Inference in Binary and Ternary Neural Networks</h6>
147-
The codebase provides ready-to-use <b>NumPy</b>-based, <b>Torch</b>-based, and native <b>C++</b> implementations of our <i>RSR</i> and <i>RSR++</i> algorithms.
148-
It contains codes for the inference on CPU and GPU and sample experiments on `1.58bit` models, including
149-
<a href="https://huggingface.co/HF1BitLLM/Llama3-8B-1.58-100B-tokens" target="_blank">[`Llama3-8B-1.58bit`]</a>, <a href="https://huggingface.co/tiiuae/Falcon3-10B-Instruct-1.58bit" target="_blank">[`Falcon3-10B-1.58bit`]</a>, and <a href="https://huggingface.co/tiiuae/Falcon3-3B-Instruct-1.58bit" target="_blank">[`Falcon3-3B-1.58bit`]</a>.
150-
147+
This project aims to provide a fast and efficient approach to low-bit matrix multiplication.
148+
The code repository implements Redundant Segment Reduction (RSR), a fast matrix multiplication algorithm designed for matrices in binary and ternary networks.
149+
The RSR method optimizes computation efficiency by a log(n) factor, making it particularly useful for applications in low-bit deep learning and efficient inference.
150+
The codebase provides ready-to-use <b>C++</b> and <b>NumPy</b>-based implementations, as well as <b>PyTorch</b> implementations with both <b>CPU</b> and <b>GPU</b> support, enabling scalable and optimized matrix operations in deep learning environments.
151+
It includes sample experiments on various `1.58bit` models and LLMs.✨
151152
<div class="project-links">
152153
<a href="https://github.com/UIC-InDeXLab/RSR/" class="btn github-btn" target="_blank">
153154
🛠️ View on GitHub
@@ -156,8 +157,8 @@ <h6><b>RSR 🧮</b>: Efficient Matrix Multiplication for Accelerating Inference
156157
📄 Read Publication (Preprint)
157158
</a>
158159
</div>
159-
</div>
160-
</div>
160+
</div>
161+
</div>
161162
</div>
162163
</div>
163164

0 commit comments

Comments
 (0)