Skip to content

Commit 6fa2b47

Browse files
authored
feat: transformers v4 API (#1116)
Signed-off-by: adil-a <adil.asif2000@hotmail.com> Signed-off-by: adi-a <adil.asif2000@hotmail.com>
1 parent d86b960 commit 6fa2b47

File tree

62 files changed

+2073
-1907
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

62 files changed

+2073
-1907
lines changed

examples/benchmark/configs/deepseek_v3_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -80,7 +80,6 @@ model:
8080

8181
checkpoint:
8282
enabled: false
83-
load_base_model: false
8483

8584
loss_fn:
8685
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/deepseek_v3_te_deepep_1024.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,6 @@ model:
6161

6262
checkpoint:
6363
enabled: false
64-
load_base_model: false
6564

6665
distributed:
6766
_target_: nemo_automodel.components.distributed.fsdp2.FSDP2Manager

examples/benchmark/configs/deepseek_v3_torch.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,6 @@ model:
7676

7777
checkpoint:
7878
enabled: false
79-
load_base_model: false
8079

8180
loss_fn:
8281
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/glm_4.5_air_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,6 @@ model:
7979

8080
checkpoint:
8181
enabled: false
82-
load_base_model: false
8382

8483
loss_fn:
8584
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/gptoss_120b_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -78,7 +78,6 @@ model:
7878

7979
checkpoint:
8080
enabled: false
81-
load_base_model: false
8281

8382
loss_fn:
8483
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/gptoss_20b_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,6 @@ model:
7777

7878
checkpoint:
7979
enabled: false
80-
load_base_model: false
8180

8281
loss_fn:
8382
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/gptoss_20b_torch.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,6 @@ model:
7676

7777
checkpoint:
7878
enabled: false
79-
load_base_model: false
8079

8180
loss_fn:
8281
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/kimi_k2_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,6 @@ model:
7777

7878
checkpoint:
7979
enabled: false
80-
load_base_model: false
8180

8281
loss_fn:
8382
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/moonlight_16b_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,6 @@ model:
7979

8080
checkpoint:
8181
enabled: false
82-
load_base_model: false
8382

8483
loss_fn:
8584
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/moonlight_16b_torch.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,6 @@ model:
7676

7777
checkpoint:
7878
enabled: false
79-
load_base_model: false
8079

8180
loss_fn:
8281
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

0 commit comments

Comments
 (0)