Skip to content

Commit b0ac32e

Browse files
author
Ervin T
authored
Cherry-pick separate critic only for PPO (#4661) (#4666)
1 parent 4e5b5b7 commit b0ac32e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

ml-agents/mlagents/trainers/ppo/trainer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -254,7 +254,7 @@ def create_torch_policy(
254254
behavior_spec,
255255
self.trainer_settings,
256256
condition_sigma_on_obs=False, # Faster training for PPO
257-
separate_critic=behavior_spec.action_spec.is_continuous(),
257+
separate_critic=True, # Match network architecture with TF
258258
)
259259
return policy
260260

0 commit comments

Comments
 (0)