Skip to content

Commit 91ff9a1

Browse files
committed
missed config update change
Signed-off-by: Maanu Grover <[email protected]>
1 parent 118af8d commit 91ff9a1

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

nemo/tron/checkpointing.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1132,8 +1132,8 @@ def load_checkpoint(
11321132
run_tp_pp = (
11331133
cfg.model_config.tensor_model_parallel_size,
11341134
cfg.model_config.pipeline_model_parallel_size,
1135-
cfg.dist_config.encoder_tensor_model_parallel_size,
1136-
cfg.dist_config.encoder_pipeline_model_parallel_size,
1135+
getattr(cfg.model_config, "encoder_tensor_model_parallel_size", 0),
1136+
getattr(cfg.model_config, "encoder_pipeline_model_parallel_size", 0),
11371137
)
11381138
mismatch_msg = "(TP, PP, encoder TP, encoder PP) mismatch after resume ({} vs {} from checkpoint)".format(
11391139
run_tp_pp, ckpt_tp_pp

0 commit comments

Comments
 (0)