recalculation of reward after action update

Hi @homangab, 
Thanks for the effort to boost the performance of CEM optimization in the right (gradient) direction.
However reading the code I don't see (probably should be somwhere [here](https://github.com/homangab/gradcem/blob/02a8b36269704ab7e4c1207b6420cc788286fd67/mpc/gradcem.py#L62)) an update of rewards after the actions are updated by optimizer. https://github.com/homangab/gradcem/blob/02a8b36269704ab7e4c1207b6420cc788286fd67/mpc/gradcem.py#L44 should be calculated once again. Am I wrong?

Regards, 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

recalculation of reward after action update #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

recalculation of reward after action update #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions