Question about the draft/target memory ratio

Thanks for your interesting work,  I believe that the project provides new theoretical analysis and insights about speculative decoding.

I would like to ask a question about the draft/target memory ratio.  The paper shows that "the draft models can occupy up to 38∼140% memory footprint of target models", but I didn't find any equation related to this. I wanna to know how do you analysis it theoretically? Could you provide a specific equation?
![微信截图_20241120164006](https://github.com/user-attachments/assets/e78e075a-eb8d-47af-8110-ccb8c3a642c6)




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about the draft/target memory ratio #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about the draft/target memory ratio #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions