Skip to content

Commit 65c7509

Browse files
Update README.md
1 parent 56f0160 commit 65c7509

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -124,8 +124,10 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
124124
](https://arxiv.org/abs/2011.07553)
125125

126126
* **Probabilistic Mixture-of-Experts (PMOE)** :
127-
128-
`pmoe.py`: uses a differentiable multi-modal Gaussian distribution to replace the standard unimodal Gaussian distribution for policy representation.
127+
PMOE uses a differentiable multi-modal Gaussian distribution to replace the standard unimodal Gaussian distribution for policy representation.
128+
`pmoe_sac.py`: based on off-policy SAC.
129+
130+
`pmoe_ppo.py`: based on on-policy PPO.
129131

130132
paper: [Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
131133
](https://arxiv.org/pdf/2104.09122)

0 commit comments

Comments
 (0)