Skip to content

Commit 164c8c0

Browse files
committed
update
1 parent 95e4376 commit 164c8c0

File tree

2 files changed

+6
-6
lines changed

2 files changed

+6
-6
lines changed

index.html

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -327,11 +327,11 @@ <h2 class="title is-3"><img src="static/images/Telescope.png" style="height: 50p
327327
<section class="section" id="BibTeX">
328328
<div class="container is-max-desktop content">
329329
<h2 class="title">BibTeX</h2>
330-
<pre><code>@article{sun2024TriForce,
331-
title={TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding},
332-
author={Sun, Hanshi and Chen, Zhuoming and Yang, Xinyu and Tian, Yuandong and Chen, Beidi},
333-
journal={arXiv preprint arXiv:2404.11912},
334-
year={2024}
330+
<pre><code>@article{sun2024triforce,
331+
title={Triforce: Lossless acceleration of long sequence generation with hierarchical speculative decoding},
332+
author={Sun, Hanshi and Chen, Zhuoming and Yang, Xinyu and Tian, Yuandong and Chen, Beidi},
333+
journal={arXiv preprint arXiv:2404.11912},
334+
year={2024}
335335
}</code></pre>
336336
</div>
337337
</section>

models/modeling_llama_68m.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -156,7 +156,7 @@ def forward(
156156
query_states = query_states.transpose(1, 2)
157157
key_states = key_states.transpose(1, 2)
158158

159-
position_ids = torch.arange(graph_cache.real_budget-graph_cache.gamma-1, graph_cache.real_budget-graph_cache.gamma+gamma_offset, device=position_ids.device).unsqueeze(0)
159+
position_ids = torch.arange(graph_cache.real_budget-graph_cache.gamma-3, graph_cache.real_budget-graph_cache.gamma+gamma_offset-2, device=position_ids.device).unsqueeze(0)
160160
query_states = apply_rotary_pos_emb_single(query_states, cos, sin, position_ids)
161161
key_position_ids = torch.arange(kv_seq_len, device=position_ids.device).unsqueeze(0)
162162
key_states = apply_rotary_pos_emb_single(key_states, cos, sin, key_position_ids)

0 commit comments

Comments
 (0)