Skip to content

Commit e0ad771

Browse files
Update README.md
1 parent 0a3a121 commit e0ad771

File tree

1 file changed

+7
-0
lines changed

1 file changed

+7
-0
lines changed

README.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -118,6 +118,13 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
118118
`qmix.py`: a fully cooperative multi-agent RL algorithm, demo environment using [pettingzoo](https://www.pettingzoo.ml/atari/entombed_cooperative).
119119

120120
paper: http://proceedings.mlr.press/v80/rashid18a.html
121+
122+
* **Phasic Policy Gradient (PPG)**:
123+
124+
todo
125+
126+
paper: [Phasic Policy Gradient](http://proceedings.mlr.press/v139/cobbe21a.html)
127+
121128

122129
* **Maximum a Posteriori Policy Optimisation (MPO)**:
123130

0 commit comments

Comments
 (0)