Skip to content

Commit 9f77e26

Browse files
Update README.md
1 parent e5d72ab commit 9f77e26

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -113,6 +113,11 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
113113
paper: [Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
114114
](https://arxiv.org/pdf/2104.09122)
115115

116+
* **QMIX**:
117+
118+
`qmix.py`: a multi-agent RL algorithm, demo environment using [pettingzoo](https://www.pettingzoo.ml/atari/entombed_cooperative).
119+
120+
paper: http://proceedings.mlr.press/v80/rashid18a.html
116121

117122
* **Maximum a Posteriori Policy Optimisation (MPO)**:
118123

0 commit comments

Comments
 (0)