-
Notifications
You must be signed in to change notification settings - Fork 2k
Open
Labels
Speculative Decoding<NV>MTP/Eagle/Medusa/Lookahead/Prompt-Lookup-Decoding/Draft-Target-Model/ReDrafter<NV>MTP/Eagle/Medusa/Lookahead/Prompt-Lookup-Decoding/Draft-Target-Model/ReDrafterbugSomething isn't workingSomething isn't working
Description
System Info
There currently is not any support for MTP like DeepSeek has for Qwen3 next
Who can help?
No response
Information
- The official example scripts
- My own modified scripts
Tasks
- An officially supported task in the
examplesfolder (such as GLUE/SQuAD, ...) - My own task or dataset (give details below)
Reproduction
Attempt to run Qwen3 Next like the example but with MTP
Expected behavior
Support like DeepSeek has
actual behavior
Super slow inference due to the fact that there's no MTP speculative decoding support for Qwen3 Next
additional notes
| class DeepseekV3MTP(DeepseekV3DecoderLayer): |
Before submitting a new issue...
- Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.
coderabbitai
Metadata
Metadata
Assignees
Labels
Speculative Decoding<NV>MTP/Eagle/Medusa/Lookahead/Prompt-Lookup-Decoding/Draft-Target-Model/ReDrafter<NV>MTP/Eagle/Medusa/Lookahead/Prompt-Lookup-Decoding/Draft-Target-Model/ReDrafterbugSomething isn't workingSomething isn't working