Evaluate reward terms on historic state-action data #2206
-
For a manager-based environment, I’d like to evaluate the reward terms on a pre-collected dataset of observations and actions. Is there a good way to do this without explicitly storing all reward terms, or is the only way to just store all reward terms in the pre-collected dataset? Thanks in advance for any advice. |
Beta Was this translation helpful? Give feedback.
Answered by
RandomOakForest
Apr 4, 2025
Replies: 1 comment 1 reply
-
Thank you for posting this. Storing all reward terms explicitly is the only way to do so, currently. |
Beta Was this translation helpful? Give feedback.
1 reply
Answer selected by
andrschl
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thank you for posting this. Storing all reward terms explicitly is the only way to do so, currently.