[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance by rahul-tuli · Pull Request #120 · neuralmagic/vllm

rahul-tuli · 2025-09-29T12:20:44Z

Eagle3 drafters were incorrectly inheriting the verifier's quantization
configuration instead of using their own, causing KeyError when loading
unquantized drafter weights with quantized verifiers.

This implements a clean inheritance pattern where:

Base LlamaDecoderLayer has configurable get_quant_config() method
Eagle3 LlamaDecoderLayer overrides to use drafter's quantization config
Uses existing VllmConfig.get_quantization_config() infrastructure

…ance Eagle3 drafters were incorrectly inheriting the verifier's quantization configuration instead of using their own, causing KeyError when loading unquantized drafter weights with quantized verifiers. This implements a clean inheritance pattern where: - Base LlamaDecoderLayer has configurable get_quant_config() method - Eagle3 LlamaDecoderLayer overrides to use drafter's quantization config - Uses existing VllmConfig._get_quantization_config() infrastructure Fixes speculative decoding with quantized verifier + unquantized drafter. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> Signed-off-by: rtuli@redhat.com Signed-off-by: Rahul Tuli <rtuli@redhat.com>

rahul-tuli · 2025-09-29T15:58:34Z

Landed on vllm main!

rahul-tuli closed this Sep 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance#120

[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance#120
rahul-tuli wants to merge 1 commit intomainfrom
fix/eagle3-quantization-config

rahul-tuli commented Sep 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

rahul-tuli commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rahul-tuli commented Sep 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rahul-tuli commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rahul-tuli commented Sep 29, 2025 •

edited by github-actions bot

Loading