From 087560c7899bc5b0c4a08ef0725b437dede2d239 Mon Sep 17 00:00:00 2001 From: Jinuk Kim Date: Mon, 9 Jun 2025 03:06:33 +0900 Subject: [PATCH 1/2] Update README.md --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index 5be32b5..86894ca 100644 --- a/README.md +++ b/README.md @@ -109,6 +109,10 @@ Amir Gholami\* , Sehoon Kim\* , Zhen Dong\* , Zhewei Yao\* , Michael W. Mahoney, ---- +### 2025 + +- [[ICML](https://arxiv.org/abs/2505.07004)] GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance [[code](https://github.com/snu-mllab/GuidedQuant)]![GitHub Repo stars](https://img.shields.io/github/stars/snu-mllab/GuidedQuant) + ### 2024 - [[TMLR](https://openreview.net/pdf?id=IEKtMMSblm)] PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off [[code](https://github.com/sachitkuhar/PLUM)][[webpage](https://github.com/sachitkuhar/PLUM)][[video](https://www.youtube.com/watch?v=nE_CYDWqQ_I)][**`bnn`**] [**`inference`**] From f473b7f8f8d7d75366669ff00c35045dff46fb28 Mon Sep 17 00:00:00 2001 From: Jinuk Kim Date: Mon, 9 Jun 2025 03:08:17 +0900 Subject: [PATCH 2/2] Update README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 86894ca..9ead8c8 100644 --- a/README.md +++ b/README.md @@ -12,6 +12,7 @@ This repo collects papers, documents, and codes about model quantization for any - [Survey\_of\_Binarization](#survey_of_binarization) - [Survey\_of\_Quantization](#survey_of_quantization) - [Papers](#papers) + - [2025](#2025) - [2024](#2024) - [2023](#2023) - [2022](#2022)