Skip to content

Commit c4ee535

Browse files
authored
[fix] fix eagle3 two model disaggregated serving test (NVIDIA#6014)
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
1 parent 6d4b045 commit c4ee535

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/integration/defs/accuracy/test_disaggregated_serving.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -262,7 +262,7 @@ def test_eagle3(self, overlap_scheduler):
262262
speculative_decoding_config = {
263263
"decoding_type": "Eagle",
264264
"max_draft_len": 4,
265-
"pytorch_weights_path":
265+
"speculative_model_dir":
266266
f"{llm_models_root()}/EAGLE3-LLaMA3.1-Instruct-8B",
267267
"eagle3_one_model": False
268268
}

0 commit comments

Comments
 (0)