Skip to content

Commit d763771

Browse files
authored
Merge branch 'main' into akoumparouli/fix_re_enabled_megatron_fsdp
2 parents bd66859 + f942ec3 commit d763771

File tree

69 files changed

+4767
-1930
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

69 files changed

+4767
-1930
lines changed

docs/model-coverage/vlm.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,7 @@ NeMo Automodel supports [AutoModelForImageTextToText](https://huggingface.co/doc
2525

2626
| Model | Dataset | FSDP2 | PEFT | Example YAML |
2727
|------------------------------------|-----------------------------|------------|------------|--------------|
28+
| Kimi-VL-A3B-Instruct | cord-v2 | Supported | Supported | [kimi2vl_cordv2.yaml](../../examples/vlm_finetune/kimi/kimi2vl_cordv2.yaml) |
2829
| Gemma 3-4B & 27B | naver-clova-ix & rdr-items | Supported | Supported | [gemma3_vl_4b_cord_v2.yaml](../../examples/vlm_finetune/gemma3/gemma3_vl_4b_cord_v2.yaml) |
2930
| Gemma 3n | naver-clova-ix & rdr-items | Supported | Supported | [gemma3n_vl_4b_medpix.yaml](../../examples/vlm_finetune/gemma3n/gemma3n_vl_4b_medpix.yaml) |
3031
| Qwen2-VL-2B-Instruct & Qwen2.5-VL-3B-Instruct | cord-v2 | Supported | Supported | [qwen2_5_vl_3b_rdr.yaml](../../examples/vlm_finetune/qwen2_5/qwen2_5_vl_3b_rdr.yaml) |

examples/benchmark/configs/deepseek_v3_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -80,7 +80,6 @@ model:
8080

8181
checkpoint:
8282
enabled: false
83-
load_base_model: false
8483

8584
loss_fn:
8685
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/deepseek_v3_te_deepep_1024.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,6 @@ model:
6161

6262
checkpoint:
6363
enabled: false
64-
load_base_model: false
6564

6665
distributed:
6766
_target_: nemo_automodel.components.distributed.fsdp2.FSDP2Manager

examples/benchmark/configs/deepseek_v3_torch.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,6 @@ model:
7676

7777
checkpoint:
7878
enabled: false
79-
load_base_model: false
8079

8180
loss_fn:
8281
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/glm_4.5_air_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,6 @@ model:
7979

8080
checkpoint:
8181
enabled: false
82-
load_base_model: false
8382

8483
loss_fn:
8584
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/gptoss_120b_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -78,7 +78,6 @@ model:
7878

7979
checkpoint:
8080
enabled: false
81-
load_base_model: false
8281

8382
loss_fn:
8483
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/gptoss_20b_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,6 @@ model:
7777

7878
checkpoint:
7979
enabled: false
80-
load_base_model: false
8180

8281
loss_fn:
8382
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/gptoss_20b_torch.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,6 @@ model:
7676

7777
checkpoint:
7878
enabled: false
79-
load_base_model: false
8079

8180
loss_fn:
8281
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/kimi_k2_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,6 @@ model:
7777

7878
checkpoint:
7979
enabled: false
80-
load_base_model: false
8180

8281
loss_fn:
8382
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

examples/benchmark/configs/moonlight_16b_te_deepep.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,6 @@ model:
7979

8080
checkpoint:
8181
enabled: false
82-
load_base_model: false
8382

8483
loss_fn:
8584
_target_: nemo_automodel.components.loss.masked_ce.MaskedCrossEntropy

0 commit comments

Comments
 (0)