Commit db892e7
Mohammad Mahdi Kamani
Fix input layer norm mismatch for Eagle Speculative Decoding compatib… (ROCm#548)
* Fix input layer norm mismatch for Eagle Speculative Decoding compatibility
The LLaMA decoder layer applies input layer normalization at every layer, whereas Eagle omits it for the initial layer, using a dummy InputLayerNorm class instead.
Recently, LLaMA's input layer norm implementation (https://github.com/ROCm/vllm/blob/262ed1e16c5bd71f0612b700186854b8c932565d/vllm/model_executor/models/llama.py#L326) was updated to accept at most 3 inputs. To maintain compatibility and prevent Eagle Speculative Decoding from failing, this dummy class needs to be updated accordingly.
* Update eagle.py1 parent d1d3ff9 commit db892e7
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
32 | | - | |
| 32 | + | |
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
| |||
0 commit comments