PAAC.pytorch

Pytorch implementation of the PAAC algorithm presented in "Efficient Parallel Methods for Deep Reinforcement Learning". PAAC is the abbreviation of Parallel Advantage Actor-Critic.

Currently, because the PAAC network is not using LSTM, the evaluation result is not very good. I'm working on the LSTM version of PAAC (waiting for a new graphic card due to lack of current gpu's memory.)

The original paper is here: https://arxiv.org/abs/1705.04862

Requirements

PAAC.pytorch requires torch, torchvision, PIL, gym.

Libraries used in this project:

torch==0.1.12+32e6665
torchvision==0.1.8
Pillow==4.1.1
gym@797a25d1b1a8823b305fdb575c4378a5c288b432

Result (BreakoutDeterministic-v4 training log)

https://www.youtube.com/watch?v=6FMzNaL88wQ

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
README.md		README.md
eval.py		eval.py
gym_wrapper.py		gym_wrapper.py
log.png		log.png
logger.py		logger.py
paac-breakout-1280014.pkl		paac-breakout-1280014.pkl
paac.py		paac.py
train.py		train.py
worker.py		worker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PAAC.pytorch

Requirements

Result (BreakoutDeterministic-v4 training log)

About

Uh oh!

Releases

Packages

Languages

qbx2/PAAC.pytorch

Folders and files

Latest commit

History

Repository files navigation

PAAC.pytorch

Requirements

Result (BreakoutDeterministic-v4 training log)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages