Evaluate reward terms on historic state-action data #2206

andrschl · 2025-04-01T14:04:51Z

andrschl
Apr 1, 2025

For a manager-based environment, I’d like to evaluate the reward terms on a pre-collected dataset of observations and actions. Is there a good way to do this without explicitly storing all reward terms, or is the only way to just store all reward terms in the pre-collected dataset? Thanks in advance for any advice.

Answered by RandomOakForest

Apr 4, 2025

Thank you for posting this. Storing all reward terms explicitly is the only way to do so, currently.

View full answer

RandomOakForest · 2025-04-04T16:28:26Z

RandomOakForest
Apr 4, 2025
Maintainer

Thank you for posting this. Storing all reward terms explicitly is the only way to do so, currently.

1 reply

andrschl Apr 4, 2025
Author

Ok, perfect. Thanks for the quick response!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Evaluate reward terms on historic state-action data #2206

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Evaluate reward terms on historic state-action data #2206

Uh oh!

Uh oh!

andrschl Apr 1, 2025

Replies: 1 comment · 1 reply

Uh oh!

RandomOakForest Apr 4, 2025 Maintainer

Uh oh!

andrschl Apr 4, 2025 Author

andrschl
Apr 1, 2025

Replies: 1 comment 1 reply

RandomOakForest
Apr 4, 2025
Maintainer

andrschl Apr 4, 2025
Author