Skip to content

Commit 025f894

Browse files
committed
update website
1 parent 6ec5080 commit 025f894

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

docs/index.html

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -272,6 +272,14 @@ <h2 class="title is-4">Insight: Information Disparity in Pre-training vs. Fine-t
272272
This addresses the <b>serving challenge</b>.
273273
</p>
274274

275+
<p style="margin-bottom: 20px;">
276+
Past work (GPT-Zip, DeltaZip) has also explored quantization of the weight delta, achieving
277+
quantization levels as low as 2-bits by applying methods introduced by GPTQ. We find that
278+
the weight delta is extremely compressible, and are able to achieve <b>1-bit quantization</b>
279+
with minimal performance degradation using a simpler methodology.
280+
</p>
281+
282+
275283
<h2 class="title is-4">BitDelta Overview</h2>
276284
<h2 class="title is-5">1-bit quantization</h2>
277285

0 commit comments

Comments
 (0)