We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e5d72ab commit 9f77e26Copy full SHA for 9f77e26
README.md
@@ -113,6 +113,11 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
113
paper: [Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
114
](https://arxiv.org/pdf/2104.09122)
115
116
+ * **QMIX**:
117
+
118
+ `qmix.py`: a multi-agent RL algorithm, demo environment using [pettingzoo](https://www.pettingzoo.ml/atari/entombed_cooperative).
119
120
+ paper: http://proceedings.mlr.press/v80/rashid18a.html
121
122
* **Maximum a Posteriori Policy Optimisation (MPO)**:
123
0 commit comments