CORMPO: Clinically-aware OOD-regularized Model-based Policy Optimization

Overview

This repository includes an offline RL algorithm, CORMPO, and a medical environment for RL evaluation. CORMPO addresses out-of-distribution (OOD) challenges in offline reinforcement learning by incorporating clinical domain knowledge and regularization techniques for safer policy optimization.

Dependencies / Installation

Install all required dependencies:

pip install -r requirements.txt

Usage

MCS Digital Twin and RL Environment

See the README in the abiomed_env folder for environment implementation details and example scripts for using the environment.

CORMPO Training

Train CORMPO with WS penalty on noiseless synthetic dataset:

python cormpo/mbpo_kde/mopo.py --config cormpo/config/noiseless_synthetic/mbpo_kde_ws.yaml

on noiseless synthetic dataset:

python cormpo/mbpo_kde/mopo.py --config cormpo/config/noisy_synthetic/mbpo_kde.yaml

CORMPO Policy Evaluation

Evaluate a saved policy trained on noisy synthetic dataset:

python cormpo/helpers/evaluate.py --config cormpo/config/evaluate/noisy/cormpo.yaml --policy_path "checkpoints/policy/noisy_synthetic/policy_abiomed.pth"

To evaluate the policy trained on noiseless dataset, change policy_path to:

--policy_path "checkpoints/policy/noiseless_synthetic/policy_abiomed.pth"

Reference

The implementation of MOPO and MBPO-KDE is built largely on this implementation of MOPO algorithm: https://github.com/junming-yang/mopo

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
abiomed_env		abiomed_env
checkpoints		checkpoints
cormpo		cormpo
results		results
synthetic_data		synthetic_data
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CORMPO: Clinically-aware OOD-regularized Model-based Policy Optimization

Overview

Dependencies / Installation

Usage

MCS Digital Twin and RL Environment

CORMPO Training

CORMPO Policy Evaluation

Reference

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Rose-STL-Lab/CORMPO

Folders and files

Latest commit

History

Repository files navigation

CORMPO: Clinically-aware OOD-regularized Model-based Policy Optimization

Overview

Dependencies / Installation

Usage

MCS Digital Twin and RL Environment

CORMPO Training

CORMPO Policy Evaluation

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages