Skip to content

Commit 8f1a3a6

Browse files
committed
Add Eagle3 config support for auxiliary hidden state layer IDs
Support configuring eagle_aux_hidden_state_layer_ids and inference_type in the Eagle3 speculator configuration. This allows users to specify which verifier layers should output auxiliary hidden states for the drafter to consume during speculative decoding. Signed-off-by: rahul-tuli <[email protected]> Signed-off-by: Rahul Tuli <[email protected]>
1 parent 1ad3aca commit 8f1a3a6

File tree

1 file changed

+5
-0
lines changed
  • vllm/transformers_utils/configs/speculators

1 file changed

+5
-0
lines changed

vllm/transformers_utils/configs/speculators/algos.py

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -30,3 +30,8 @@ def update_eagle3(config_dict: dict, vllm_config: dict) -> None:
3030
vllm_config["norm_before_residual"] = config_dict.get(
3131
"norm_before_residual", True)
3232
vllm_config["architectures"] = ["Eagle3LlamaForCausalLM"]
33+
if config_dict.get("eagle_aux_hidden_state_layer_ids"):
34+
vllm_config["eagle_aux_hidden_state_layer_ids"] = config_dict[
35+
"eagle_aux_hidden_state_layer_ids"]
36+
if config_dict.get("inference_type"):
37+
vllm_config["inference_type"] = config_dict["inference_type"]

0 commit comments

Comments
 (0)