Deep Deterministic Policy Gradient

A clean implementation of the Deep Deterministic Policy Gradient Algorithm using PyTorch. Referenced from https://github.com/philtabor/Youtube-Code-Repository/tree/master/ReinforcementLearning/PolicyGradient/DDPG/pytorch/lunar-lander
Fixed inconsistencies with the original paper. Refactored the code and added epsilon-greedy exploration strategy, which improved training performance

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
ddpg		ddpg
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
main.py		main.py
main_demo.py		main_demo.py
requirements.txt		requirements.txt
training_result.png		training_result.png

Provide feedback