Skip to content

Commit c38f11e

Browse files
authored
Update README.md
1 parent 2024206 commit c38f11e

File tree

1 file changed

+9
-7
lines changed

1 file changed

+9
-7
lines changed

README.md

Lines changed: 9 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ The fastest off-the-shelf inference algorithm for LLMs
77
</h3>
88

99
<p align="center">
10-
| <a href="https://arxiv.org/abs/2405.14105"><b>Paper</b></a> |
10+
| <a href="https://openreview.net/forum?id=cJd1BgZ9CS"><b>Paper</b></a> (ICLR'25) |
1111
</p>
1212

1313
---
@@ -116,12 +116,14 @@ Our efforts and resources are supported by the following organizations. Thank yo
116116

117117
## Citation
118118

119-
If you use DSI (or the code in this repo) for your research, please cite our [paper](https://arxiv.org/abs/2405.14105):
119+
If you use DSI (or the code in this repo) for your research, please cite our paper (https://openreview.net/forum?id=cJd1BgZ9CS):
120120
```bibtex
121-
@article{timor2024distributed,
122-
title={Distributed Speculative Inference of Large Language Models},
123-
author={Timor, Nadav and Mamou, Jonathan and Korat, Daniel and Berchansky, Moshe and Pereg, Oren and Wasserblat, Moshe and Galanti, Tomer and Gordon, Michal and Harel, David},
124-
journal={arXiv preprint arXiv:2405.14105},
125-
year={2024}
121+
@inproceedings{
122+
timor2025dsi,
123+
title={{DSI}: Faster Inference of Large Language Models via Speculation Parallelism},
124+
author={Nadav Timor and Jonathan Mamou and Daniel Korat and Moshe Berchansky and Oren Pereg and Moshe Wasserblat and Tomer Galanti and Michal Gordon-Kiwkowitz and David Harel},
125+
booktitle={The Thirteenth International Conference on Learning Representations},
126+
year={2025},
127+
url={https://openreview.net/forum?id=cJd1BgZ9CS}
126128
}
127129
```

0 commit comments

Comments
 (0)