Skip to content

Clinically-aware OOD-regularized Model-based Policy Optimization (CORMPO), a density-regularized offline RL algorithm.

Notifications You must be signed in to change notification settings

Rose-STL-Lab/CORMPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CORMPO: Clinically-aware OOD-regularized Model-based Policy Optimization

Overview

This repository includes an offline RL algorithm, CORMPO, and a medical environment for RL evaluation. CORMPO addresses out-of-distribution (OOD) challenges in offline reinforcement learning by incorporating clinical domain knowledge and regularization techniques for safer policy optimization.

Dependencies / Installation

Install all required dependencies:

pip install -r requirements.txt

Usage

MCS Digital Twin and RL Environment

See the README in the abiomed_env folder for environment implementation details and example scripts for using the environment.

CORMPO Training

Train CORMPO with WS penalty on noiseless synthetic dataset:

python cormpo/mbpo_kde/mopo.py --config cormpo/config/noiseless_synthetic/mbpo_kde_ws.yaml

on noiseless synthetic dataset:

python cormpo/mbpo_kde/mopo.py --config cormpo/config/noisy_synthetic/mbpo_kde.yaml

CORMPO Policy Evaluation

Evaluate a saved policy trained on noisy synthetic dataset:

python cormpo/helpers/evaluate.py --config cormpo/config/evaluate/noisy/cormpo.yaml --policy_path "checkpoints/policy/noisy_synthetic/policy_abiomed.pth"

To evaluate the policy trained on noiseless dataset, change policy_path to:

--policy_path "checkpoints/policy/noiseless_synthetic/policy_abiomed.pth"

Reference

About

Clinically-aware OOD-regularized Model-based Policy Optimization (CORMPO), a density-regularized offline RL algorithm.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages