behavior of on_episode_end() in the ReplayBuffer class when I am only adding a select number of time-steps per interaction. #29
WilderLavington
started this conversation in
General
Replies: 1 comment 1 reply
-
@WilderLavington The answer is perhaps "No", but I recommend that you call it. Generally speaking, we developers assume users call In detail, |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I am currently using the Replay buffer class for storing image data and wanted to know what the effects of calling on_episode_end() for separate buffers of examples. In particular, I am storing examples with reward = 1 in one buffer and reward equal to zero in another. In this environment, reward is equal to 1 only at the end of the episode, and thus I only add one "negative" transition tuple to my replay buffer per episode. In this case, do I still need to call on_episode_end()?
Beta Was this translation helpful? Give feedback.
All reactions