Skip to content

Commit 143205a

Browse files
IzzyPuttermanvideodanchik
authored andcommitted
[None][feat] Eagle: MLA Based Eagle (NVIDIA#9677)
Signed-off-by: Izzy Putterman <iputterman@nvidia.com> Signed-off-by: Daniil Kulko <kulkodaniil@gmail.com>
1 parent db17afd commit 143205a

File tree

5 files changed

+321
-68
lines changed

5 files changed

+321
-68
lines changed

tensorrt_llm/_torch/models/modeling_deepseekv3.py

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1377,13 +1377,13 @@ def _run_MoE(hidden_states, hidden_states_fp4, do_finalize):
13771377
hidden_states, residual = self.moe_allreduce(
13781378
fc2_output, all_reduce_params=moe_all_reduce_params)
13791379
else:
1380-
if self.next_layer_layernorm is not None:
1381-
hidden_states, residual = self.next_layer_layernorm(
1382-
hidden_states, residual)
13831380
if spec_metadata is not None and spec_metadata.is_layer_capture(
13841381
self.layer_idx):
13851382
spec_metadata.maybe_capture_hidden_states(
1386-
self.layer_idx, hidden_states, None)
1383+
self.layer_idx, hidden_states, residual)
1384+
if self.next_layer_layernorm is not None:
1385+
hidden_states, residual = self.next_layer_layernorm(
1386+
hidden_states, residual)
13871387

13881388
return hidden_states, residual
13891389

0 commit comments

Comments
 (0)