Skip to content

Commit 7076670

Browse files
committed
feat: efficient eagle3 with cross attn and flex attn
Signed-off-by: h-guo18 <[email protected]>
1 parent 26c203a commit 7076670

File tree

2 files changed

+88
-307
lines changed

2 files changed

+88
-307
lines changed

examples/speculative_decoding/main.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -203,6 +203,8 @@ def train():
203203
"draft_vocab_size": custom_config["draft_vocab_size"]
204204
if eagle_args.eagle_config and "draft_vocab_size" in custom_config
205205
else model.config.vocab_size,
206+
# pass in the seq length for flex attention mask compilation
207+
"training_seq_len": training_args.training_seq_len,
206208
}
207209
)
208210

0 commit comments

Comments
 (0)