Commit f5835f9
committed
feat: efficient eagle3 with cross attn and flex attn
Signed-off-by: h-guo18 <[email protected]>1 parent aa0ac68 commit f5835f9
File tree
2 files changed
+88
-307
lines changed- examples/speculative_decoding
- modelopt/torch/speculative/plugins
2 files changed
+88
-307
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
203 | 203 | | |
204 | 204 | | |
205 | 205 | | |
| 206 | + | |
| 207 | + | |
206 | 208 | | |
207 | 209 | | |
208 | 210 | | |
| |||
0 commit comments