Trajectory planning based on Reinforcement Learning with Hindsight Experience Replay, Prioritized Experience Replay & Dense Reward Engineering to solve openai-gym robotics "FetchReach-v1" environment using PyTorch & Tensorflow2.
-
Dense Reward Engineering: Engineered vector based distance measure to replace sparse rewards.
-
Hindsight Experience Relay (HER): Implemented HER Future Strategy based goal sampling for buffer augmentation.
-
Prioritized Experience Relay (PER): Samples and optimizes the past experiences ended with errors to get better future rewards.
-
Dense Reward Engineering
DDPG Agent 
PER + DDPG Agent 
-
Hindsight Experience Repay
DDPG Agent 
PER + DDPG Agent 
Install dependencies using:
pip3 install -r requirements.txt - Additionally install 'mujoco_py' according to 'https://github.com/openai/mujoco-py'
- Name: Kanishk Navale
- Email: [email protected]
- Website: https://kanishknavale.github.io/
/Reward Engineering/data/test.gif)
/HER/data/test.gif)