Skip to content

Commit fca0a70

Browse files
committed
Update priority function name
1 parent 05004b7 commit fca0a70

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

examples/ppo_countdown_exp_replay/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ Important config parameters for experience replay include:
2929
* `priority_fn_args`: additional args for the priority function
3030
* `synchronizer.sync_style`: set to `dynamic_by_explorer`, which allows the trainer to run more training steps as long as the priority queue buffer is non-empty
3131

32-
The priority function used in this example is named `linear_decay_use_count_control_randomization`.
32+
The priority function used in this example is named `decay_limit_randomization`.
3333
The logic behind it:
3434
* Priority score is calculated as `model_version - decay * use_count`, i.e., fresher and less used samples are prioritized;
3535
* If `sigma` is non-zero, priority score is further perturbed by random Gaussian noise with standard deviation `sigma`;

examples/ppo_countdown_exp_replay/countdown.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@ buffer:
3636
replay_buffer:
3737
enable: true
3838
reuse_cooldown_time: 40
39-
priority_fn: linear_decay_use_count_control_randomization
39+
priority_fn: decay_limit_randomization
4040
# priority_fn_args: use default values
4141
explorer:
4242
eval_interval: 100

0 commit comments

Comments
 (0)