Skip to content

Commit dfc3624

Browse files
committed
fix
Signed-off-by: jianh <jianh@nvidia.com>
1 parent 4c331ed commit dfc3624

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/configs/grpo_math_1B.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -61,7 +61,7 @@ loss_fn:
6161
use_importance_sampling_correction: false
6262
truncated_importance_sampling_ratio: null
6363
truncated_importance_sampling_ratio_min: null # Lower bound for ICE-POP
64-
truncated_importance_sampling_type: null # "tis" (clamp to max) or "icepop" (filter outside [min, max])
64+
truncated_importance_sampling_type: tis # "tis" (clamp to max) or "icepop" (filter outside [min, max])
6565
sequence_level_importance_ratios: false
6666
token_level_loss: true
6767
force_on_policy_ratio: false # Set to true to force ratio=1.0 (requires train_global_batch_size == num_prompts_per_step * num_generations_per_prompt)

0 commit comments

Comments
 (0)