These notebooks were part of a larger workshop introducing policy gradient methods, initially held in Oxford in March 2023 in the Human Information Processing Laboratory. They are intended to take around 2h to complete for an initial run-through.
The "HIP_DeepRLDay_2023.ipynb" notebook contains exercises for participants to complete. They are recommended to create their own copies on Google Colab.
The "HIP_DeepRLDay_2023_solutions.ipynb" is the same notebook with answers to be filled in, to be gone through together once everyone has had a chance to work on their own.