This project implements a hybridised temporal difference learning ensemble using Deep Q-Network (DQN) agents. The ensemble approach combines model-free and model-based DQN agents to improve learning efficiency and performance in reinforcement learning tasks.
- Python 3.11+
- Poetry for dependency management
-
Clone the repository:
git clone https://github.com/H1drogen/Hybridised-TemporalDifferenceWeighted-Ensemble.git cd Hybridised-TemporalDifferenceWeighted-Ensemble -
Install dependencies with Poetry:
poetry install
Edit the Environment settings and Hyperparameters in train_model.py
Edit Ensemble Hyperparameters in tdw/tdw_ensemble.py
Edit Agent Hyperparameters in DQN_Agent.py and DQN_Guided_Exploration.py
-
To train the agent, run:
poetry run python train_model.py
-
To run Evaluation, run:
poetry run python evaluate_model.py
calling any evaluation metrics with the right path to datasets.