Project 2: Continuous Control Reacher - Submission

Introduction

This project is Ben Hosken's submission to for the Deep RL Nanodegree project 2. The environment was solved in 140 episodes.

This project uses the Reacher environment.

In this environment, a double-jointed arm can move to target locations. A reward of +0.1 is provided for each step that the agent's hand is in the goal location. Thus, the goal of your agent is to maintain its position at the target location for as many time steps as possible.

The observation space consists of 33 variables corresponding to position, rotation, velocity, and angular velocities of the arm. Each action is a vector with four numbers, corresponding to torque applicable to two joints. Every entry in the action vector should be a number between -1 and 1.

The task is episodic, and in order to solve the environment, the agent must get an average score of +30 over 100 consecutive episodes.

Getting Started

Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Single Agent
  - Linux: click here
  - Mac OSX: click here
  - Windows (32-bit): click here
  - Windows (64-bit): click here
Place the file in this folder, and unzip (or decompress) the file.

Instructions

Follow the instructions in DDPG_Reacher.ipynb to get started with training your own agent.

The agent has already been trained and the checkpointed weights have been saved at checkpoint_actor.pth and checkpoint_critic.pth

If you wish to just run with the pre-trained weights, run steps 1 and 2 but Not 3. Then run step 4 to see the trained agent in action.

If you wish to train the agent, rung steps 1, 2 and then 3.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Reacher.app/Contents		Reacher.app/Contents
__pycache__		__pycache__
.DS_Store		.DS_Store
.gitignore		.gitignore
DDPG_Reacher.ipynb		DDPG_Reacher.ipynb
README.md		README.md
checkpoint_actor.pth		checkpoint_actor.pth
checkpoint_critic.pth		checkpoint_critic.pth
ddpg_agent.py		ddpg_agent.py
model.py		model.py
report.pages		report.pages
report.pdf		report.pdf
solved_score.png		solved_score.png
unity-environment.log		unity-environment.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project 2: Continuous Control Reacher - Submission

Introduction

Getting Started

Instructions

About

Uh oh!

Releases

Packages

Languages

neuronwave/udacity-deeprl-p2

Folders and files

Latest commit

History

Repository files navigation

Project 2: Continuous Control Reacher - Submission

Introduction

Getting Started

Instructions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages