Skip to content

Commit 0a3a121

Browse files
committed
2 parents 6174295 + 7b0d8ca commit 0a3a121

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -113,6 +113,11 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
113113
paper: [Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
114114
](https://arxiv.org/pdf/2104.09122)
115115

116+
* **QMIX**:
117+
118+
`qmix.py`: a fully cooperative multi-agent RL algorithm, demo environment using [pettingzoo](https://www.pettingzoo.ml/atari/entombed_cooperative).
119+
120+
paper: http://proceedings.mlr.press/v80/rashid18a.html
116121

117122
* **Maximum a Posteriori Policy Optimisation (MPO)**:
118123

0 commit comments

Comments
 (0)