TensorFlow implementation of DRL containing
- DQN paper: https://www.nature.com/articles/nature14236
- Double DQN paper: https://arxiv.org/abs/1509.06461
- Dueling DQN paper: https://arxiv.org/abs/1511.06581
- Noisy Net (Noisy DQN) paper: https://arxiv.org/abs/1706.10295
- DQN with Prioritized Experience Replay paper: https://arxiv.org/abs/1511.05952
- Noisy Double DQN with Prioritized Experience Replay paper: https://arxiv.org/abs/1710.02298
- Noisy Dueling Double DQN with Prioritized Experience Replay paper: https://arxiv.org/abs/1710.02298
The following openai environment were tested:
- Cart Pole https://gymnasium.farama.org/environments/classic_control/cart_pole/
- Pong https://ale.farama.org/environments/pong/
- python 3.6
- tensorflow 1.12.0
- cuda 9.0
- cuDNN 7.1.4
the results are the mean of the scores in each 200 episodes of cartpole-v1.