GitHub - re0078/advantage_sum_mbrl

Overview

Model-based reinforcement learning (MBRL) methods, particularly those based on model predictive control (MPC), leverage environment models to pre-plan actions before execution. MPC optimizes a scoring function to determine optimal actions. However, incorporating global information in the scoring function accelerates learning but introduces variance. This project addresses this issue by proposing the use of the sum of advantage functions (Sum-Advantage) as a scoring function, contrasting with the previously employed sum of state-action values (Sum-Value). Our experiments on one Gym environment show that...

Installation Guide

Follow these steps to set up the project on your local machine.

Environment Setup

You can install mujoco using this guideline.

Clone the Repository and navigate to the code directory:

git clone https://github.com/re0078/advantage_sum_mbrl.git
cd advantage_sum_mbrl

Create a Virtual Environment:

virtualenv --python=<path-to-python-3.8.18> venv
source venv/bin/activate
# or
conda create --name venv --python=python3.8.18
conda activate venv

Installing requirements:

pip install -r requirements.txt --force-reinstall

Running Experiments

Run the Main Experiment Script:

python main.py --env_id Hopper-v3 --instance_number [inst_num] --scoring_method [advantage|value]

You can view the logs in logs/<env_id>/<instance_number>/<scoring_method>/logs.txt.

Also, the rewards and the saved models are stored in checkpoints/<env_id>/<instance_number>/<scoring_method>.

You can modify hyperparameters in main.py for further exploration.

Debugging

If you face any issues while cythonizing mujoco_py module you can use this command:

python3.8.18 -m pip install "cython<3"

Make sure you set the LD_LIBRARY_PATH environment variable to your mojuco bin files before running the main python script:

export LD_LIBRARY_PATH=<path-to-mujoco>/.mujoco/mujoco210/bin

Contributors

Amir Noohian
Alireza Isavand
Reza Abdollahzadeh

Acknowledgments

We would like to express our gratitude to Prof. Machado for his advice and guidance throughout the development of this project.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.idea		.idea
README.md		README.md
RL1_Report.pdf		RL1_Report.pdf
buffer.py		buffer.py
main.py		main.py
networks.py		networks.py
requirements.txt		requirements.txt
sac.py		sac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Installation Guide

Environment Setup

Running Experiments

Debugging

Contributors

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Installation Guide

Environment Setup

Running Experiments

Debugging

Contributors

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages