-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
Hi @homangab,
Thanks for the effort to boost the performance of CEM optimization in the right (gradient) direction.
However reading the code I don't see (probably should be somwhere here) an update of rewards after the actions are updated by optimizer.
Line 44 in 02a8b36
| returns = self.env.rollout(actions) |
Regards,
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels