Multi-modal Masked Siamese Network Improves Chest X-ray Representation Learning

Project overview:

Abstract:

Self-supervised learning methods for medical imaging primarily rely on imaging data during pretraining. While such approaches deliver promising results, they do not leverage associated patient or scan information collected within Electronic Health Records (EHR). Here, we propose to incorporate EHR data during self-supervised pretraining with a Masked Siamese Network (MSN) to enhance the quality of chest X-ray representations. We investigate three types of EHR data, including demographic, scan metadata, and inpatient stay information. We evaluate our approach on three publicly available chest X-ray datasets, MIMIC-CXR, CheXpert, and NIH-14, using two vision transformer (ViT) backbones, specifically ViT-Tiny and ViT-Small. In assessing the quality of the representations via linear evaluation, our proposed method demonstrates significant improvement compared to vanilla MSN and state-of-the-art self-supervised learning baselines. Our work highlights the potential of EHR-enhanced self-supervised pre-training for medical imaging.

Setup

Environment setup

Install the the environment as follows:

git clone https://github.com/nyuad-cai/CXR-EHR-MSN.git
cd CXR-EHR-MSN
conda env create -f environment.yml
conda activate cxr-ehr-env

Note: The environment is is installed on Python version 3.9.12

Dataset setup

We use MIMIC-IV EHR and MIMIC CXR for all the experiments. We do not provide both datasets. Users must acquire the data from https://mimic.physionet.org/. After data download, users have to run the EHR preprocessing jupyter notebooks to extract the utlized features and create the datasets. To perform this, perform the following steps:

Features extract:

Run all cells the EHR-data-extract.ipynb notebook located in ./notebooks/ directory
This notebook will produce CSV file called ehr_dataset_last.csv
Following that, run all cells the EHR-data-prep.ipynb notebook located in ./notebooks/ directory.
This notebooks will create all single variables, groups, and combinations datasets and save them in the same notebooks directory

Pretraining

python cxr_ehr_msn_trainer.py --dim 192 \ # ViT hidden dim 192 vs 384
			      --ehr-in 2 \ # ehr input vector dimensionality
			      --ehr-out 128 \ # ehr embedding dimensionality
			      --data-dir path/to/mimic-cxr # cxr.jpeg data dir
			      --log-dir path/to/logs-dir # ehr.csv data file
			      --num-prototypes 1024 \ # number of trainable prototypes
			      --learning-rate 0.0001 \ # learning rate value
			      --weight-decay 0.001 \ # weight decay value
			      --max-epochs 100 # number of tarining epochs

Evaluation

python evaluate.py --dim 192 \ # ViT hidden dim 192 or 384 
			    --freeze 1 \ # backbone freezing for linear evaluation 1 vs 0
	     		--dataset mimic \ # evaluation dataset mimic, chexpert, nih
			    --log-dir path/to/logs-dir # loging directory path
			    --scheduler \ # lr scheduler cosine, reduce
			    --learning-rate 0.0001 \ # learning rate value
			    --data-percen  1.0 \ # fraction of data for low data regimes 
             	--max-epochs 100 \ # number of tarining epochs

Citation

Please consider citing our work when using this repo:

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Assets		Assets
notebooks		notebooks
src		src
t-SNE results		t-SNE results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
baseline-trainer.py		baseline-trainer.py
cxr_ehr_msn_trainer.py		cxr_ehr_msn_trainer.py
environment.yml		environment.yml
evaluate.py		evaluate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-modal Masked Siamese Network Improves Chest X-ray Representation Learning

Project overview:

Abstract:

Setup

Environment setup

Dataset setup

Pretraining

Evaluation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

nyuad-cai/CXR-EHR-MSN

Folders and files

Latest commit

History

Repository files navigation

Multi-modal Masked Siamese Network Improves Chest X-ray Representation Learning

Project overview:

Abstract:

Setup

Environment setup

Dataset setup

Pretraining

Evaluation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages