Skip to content

DDPG / HER #5

@josiahls

Description

@josiahls

Discussion
Now PER is added, many of the basic DQNs are added. We move to DDPG. Primary goals:

  • DDPG algorithm for continuous envs
  • A way to get DDPG to work in a Discrete env like the maze for debugging.
  • HER: Hindsight Experience Replay is useful for speeding up learning / improve exploration. This will possibly involve some considerations in regards to data bunches and the MDPSlices.
  • Once these are done, is there a way to unify this under a single fit function?

Metadata

Metadata

Assignees

No one assigned

    Labels

    DiscussionGenerally something we should talk / debate about

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions