File tree Expand file tree Collapse file tree 1 file changed +4
-2
lines changed Expand file tree Collapse file tree 1 file changed +4
-2
lines changed Original file line number Diff line number Diff line change @@ -124,8 +124,10 @@ Since Tensorflow 2.0 has already incorporated the dynamic graph construction ins
124
124
] ( https://arxiv.org/abs/2011.07553 )
125
125
126
126
* ** Probabilistic Mixture-of-Experts (PMOE)** :
127
-
128
- ` pmoe.py ` : uses a differentiable multi-modal Gaussian distribution to replace the standard unimodal Gaussian distribution for policy representation.
127
+ PMOE uses a differentiable multi-modal Gaussian distribution to replace the standard unimodal Gaussian distribution for policy representation.
128
+ ` pmoe_sac.py ` : based on off-policy SAC.
129
+
130
+ ` pmoe_ppo.py ` : based on on-policy PPO.
129
131
130
132
paper: [ Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
131
133
] ( https://arxiv.org/pdf/2104.09122 )
You can’t perform that action at this time.
0 commit comments