Skip to content

Commit dd70039

Browse files
committed
2 parents c5676ff + d577359 commit dd70039

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -72,7 +72,7 @@ The flow tested in `speculative_decoding_v0.py`:
7272
This “partial verification” trims the work the main decoder performs after each divergence, while keeping the responsiveness of the draft hypothesis. Early timing experiments from `speculative_decoding_v0.py` show the verification pass (~0.15 s in the example) is significantly cheaper than recomputing a full decoding step every time.
7373

7474
<p align="center">
75-
<img src="partial_speculative_decoding.png"width="730">
75+
<img src="https://raw.githubusercontent.com/QuentinFuxa/NoLanguageLeftWaiting/05b8d868cc74a3f14c67e35bfbe460d8ff78d512/partial_speculative_decoding.png"width="730">
7676
</p>
7777

7878

0 commit comments

Comments
 (0)