Skip to content

Commit c2b36f2

Browse files
authored
fix: grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts runs 40 steps (#1231)
Signed-off-by: Terry Kong <terryk@nvidia.com>
1 parent 17ea9ab commit c2b36f2

File tree

3 files changed

+5
-5
lines changed

3 files changed

+5
-5
lines changed

examples/configs/recipes/llm/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.yaml renamed to examples/configs/recipes/llm/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.v2.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -42,11 +42,11 @@ policy:
4242
data:
4343
max_input_seq_length: 4096
4444
logger:
45-
log_dir: logs/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts
45+
log_dir: logs/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.v2
4646
wandb_enabled: true
4747
tensorboard_enabled: true
4848
wandb:
4949
project: nemo-rl
50-
name: grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts
50+
name: grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.v2
5151
cluster:
5252
gpus_per_node: 8

tests/test_suites/llm/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.sh renamed to tests/test_suites/llm/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.v2.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,8 @@ source $SCRIPT_DIR/common.env
44

55
# ===== BEGIN CONFIG =====
66
NUM_NODES=1
7-
STEPS_PER_RUN=100
8-
MAX_STEPS=100
7+
STEPS_PER_RUN=40
8+
MAX_STEPS=40
99
NUM_RUNS=$(( (MAX_STEPS + STEPS_PER_RUN - 1) / STEPS_PER_RUN )) # Round up
1010
NUM_MINUTES=120
1111
# ===== END CONFIG =====

tests/test_suites/nightly.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ tests/test_suites/llm/grpo-gspo-deepscaler-1.5b-8K.sh
3535
tests/test_suites/llm/grpo-math-qwen3-30ba3b-megatron-tp4-32k.sh
3636

3737
# FP8
38-
tests/test_suites/llm/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.sh
38+
tests/test_suites/llm/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-rollouts.v2.sh
3939
tests/test_suites/llm/grpo-llama3.1-8b-instruct-1n8g-megatron-fp8-e2e.sh
4040

4141
# Non-colocated

0 commit comments

Comments
 (0)