Skip to content

Commit 0e6e0f7

Browse files
committed
update threshold
Signed-off-by: Yuki Huang <yukih@nvidia.com>
1 parent 4b020fc commit 0e6e0f7

File tree

1 file changed

+2
-3
lines changed

1 file changed

+2
-3
lines changed

tests/functional/grpo_multiturn.sh

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ uv run coverage run -a --data-file=$PROJECT_ROOT/tests/.coverage --source=$PROJE
2929
policy.max_total_sequence_length=1024 \
3030
policy.train_global_batch_size=4 \
3131
policy.train_micro_batch_size=1 \
32-
policy.generation.top_p=0.99 \
32+
policy.generation.top_p=0.9 \
3333
policy.generation.top_k=8000 \
3434
logger.tensorboard_enabled=true \
3535
logger.log_dir=$LOG_DIR \
@@ -41,5 +41,4 @@ uv run coverage run -a --data-file=$PROJECT_ROOT/tests/.coverage --source=$PROJE
4141
uv run tests/json_dump_tb_logs.py $LOG_DIR --output_path $JSON_METRICS
4242

4343
uv run tests/check_metrics.py $JSON_METRICS \
44-
'max(data["train/token_mult_prob_error"]) < 1.1'
45-
44+
'median(data["train/token_mult_prob_error"]) < 1.1'

0 commit comments

Comments
 (0)