We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 65c7509 commit adfe505Copy full SHA for adfe505
README.md
@@ -124,7 +124,9 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
124
](https://arxiv.org/abs/2011.07553)
125
126
* **Probabilistic Mixture-of-Experts (PMOE)** :
127
+
128
PMOE uses a differentiable multi-modal Gaussian distribution to replace the standard unimodal Gaussian distribution for policy representation.
129
130
`pmoe_sac.py`: based on off-policy SAC.
131
132
`pmoe_ppo.py`: based on on-policy PPO.
0 commit comments