Skip to content

Commit 0374c25

Browse files
committed
doc fix
1 parent d816788 commit 0374c25

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/dpo_humanlike/train_dpo.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ actor_rollout_ref:
2626
min_lr_ratio: 0.1 # only useful for warmup with cosine
2727
warmup_style: cosine # select from constant/cosine
2828
total_training_steps: 783 #
29-
betas: [0.9, 0.95] # set to smaller value for scenarios with abrupt distribution shift (e.g., large sync_interval)
29+
betas: [0.9, 0.95]
3030
fsdp_config:
3131
wrap_policy:
3232
# transformer_layer_cls_to_wrap: None

0 commit comments

Comments
 (0)