We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
2 parents 6174295 + 7b0d8ca commit 0a3a121Copy full SHA for 0a3a121
README.md
@@ -113,6 +113,11 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
113
paper: [Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
114
](https://arxiv.org/pdf/2104.09122)
115
116
+ * **QMIX**:
117
+
118
+ `qmix.py`: a fully cooperative multi-agent RL algorithm, demo environment using [pettingzoo](https://www.pettingzoo.ml/atari/entombed_cooperative).
119
120
+ paper: http://proceedings.mlr.press/v80/rashid18a.html
121
122
* **Maximum a Posteriori Policy Optimisation (MPO)**:
123
0 commit comments