Skip to content

Commit 02febf1

Browse files
authored
fix: add mask seq with high logp err to nemo gym config (#1980)
Signed-off-by: cmunley1 <cmunley@nvidia.com>
1 parent bdc967c commit 02febf1

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

examples/nemo_gym/grpo_workplace_assistant_nemotron_nano_v2_9b.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,7 @@ grpo:
3535
target_min: 0.0
3636
target_max: 1.0
3737
skip_reference_policy_logprobs_calculation: true
38+
seq_logprob_error_threshold: null
3839

3940
loss_fn:
4041
reference_policy_kl_penalty: 0

0 commit comments

Comments
 (0)