This Jupyter Notebook was prepared by the following group members:
- Lang Kah Chun
- Yee Qing Wei
- Sun Qi Yang
The main objective of this reinforcement learning assignment is to use the OpenAI WindyGridWorld Environment to train a smart agent to maximize the reward.
We included three different modified WindyGridWorld environments and used several approaches to solve the assignment. Our methodologies and results are documented and analyzed within this notebook.