-
Notifications
You must be signed in to change notification settings - Fork 23
Open
Description
I am studying by referring to your CQL code.
But, I think Line 68 not be proper to Offline RL when I run the train.py of CQL-SAC.
Line 68 : buffer.add(state, action, reward, next_state, done)
Isn't this line an off-policy model by putting data that interacts with the agent and the environment into a buffer?
I thank you for your hard works.
Metadata
Metadata
Assignees
Labels
No labels