Implementation of the Options framework, using Q-learning algorithm.
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning [Paper]
- Numpy
- Matplotlib
- Gym
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Implementation of the Options framework, using Q-learning algorithm.
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning [Paper]