We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 0a3a121 commit e0ad771Copy full SHA for e0ad771
README.md
@@ -118,6 +118,13 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
118
`qmix.py`: a fully cooperative multi-agent RL algorithm, demo environment using [pettingzoo](https://www.pettingzoo.ml/atari/entombed_cooperative).
119
120
paper: http://proceedings.mlr.press/v80/rashid18a.html
121
+
122
+ * **Phasic Policy Gradient (PPG)**:
123
124
+ todo
125
126
+ paper: [Phasic Policy Gradient](http://proceedings.mlr.press/v139/cobbe21a.html)
127
128
129
* **Maximum a Posteriori Policy Optimisation (MPO)**:
130
0 commit comments