Commit 8938a09
committed
feat: efficient eagle3 with cross attn and flex attn
Signed-off-by: h-guo18 <[email protected]>1 parent be95a10 commit 8938a09
File tree
2 files changed
+88
-307
lines changed- examples/speculative_decoding
- modelopt/torch/speculative/plugins
2 files changed
+88
-307
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
182 | 182 | | |
183 | 183 | | |
184 | 184 | | |
| 185 | + | |
| 186 | + | |
185 | 187 | | |
186 | 188 | | |
187 | 189 | | |
| |||
0 commit comments