Skip to content

Commit 646d6d3

Browse files
Fixed typo in PPO doc (#983)
* Fixed typo Fixed typo * Update changelog Co-authored-by: Antonin Raffin <[email protected]>
1 parent d532362 commit 646d6d3

File tree

2 files changed

+5
-4
lines changed

2 files changed

+5
-4
lines changed

docs/misc/changelog.rst

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -28,8 +28,9 @@ Others:
2828

2929
Documentation:
3030
^^^^^^^^^^^^^^
31-
- Fix typo in docstring "nature" -> "Nature" (@Melanol)
32-
- Add info on split tensorboard logs into (@Melanol)
31+
- Fixed typo in docstring "nature" -> "Nature" (@Melanol)
32+
- Added info on split tensorboard logs into (@Melanol)
33+
- Fixed typo in ppo doc (@francescoluciano)
3334

3435

3536
Release 1.6.0 (2022-07-11)
@@ -1014,4 +1015,4 @@ And all the contributors:
10141015
@eleurent @ac-93 @cove9988 @theDebugger811 @hsuehch @Demetrio92 @thomasgubler @IperGiove @ScheiklP
10151016
@simoninithomas @armandpl @manuel-delverme @Gautam-J @gianlucadecola @buoyancy99 @caburu @xy9485
10161017
@Gregwar @ycheng517 @quantitative-technologies @bcollazo @git-thor @TibiGG @cool-RR @MWeltevrede
1017-
@Melanol @qgallouedec
1018+
@Melanol @qgallouedec @francescoluciano

docs/modules/ppo.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ PPO
88
The `Proximal Policy Optimization <https://arxiv.org/abs/1707.06347>`_ algorithm combines ideas from A2C (having multiple workers)
99
and TRPO (it uses a trust region to improve the actor).
1010

11-
The main idea is that after an update, the new policy should be not too far form the old policy.
11+
The main idea is that after an update, the new policy should be not too far from the old policy.
1212
For that, ppo uses clipping to avoid too large update.
1313

1414

0 commit comments

Comments
 (0)