-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Labels
DiscussionGenerally something we should talk / debate aboutGenerally something we should talk / debate about
Description
Discussion
Now PER is added, many of the basic DQNs are added. We move to DDPG. Primary goals:
- DDPG algorithm for continuous envs
- A way to get DDPG to work in a Discrete env like the maze for debugging.
- HER: Hindsight Experience Replay is useful for speeding up learning / improve exploration. This will possibly involve some considerations in regards to data bunches and the MDPSlices.
- Once these are done, is there a way to unify this under a single fit function?
Metadata
Metadata
Assignees
Labels
DiscussionGenerally something we should talk / debate aboutGenerally something we should talk / debate about