You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fixed the package name cehrbert -> cehrbert_data for hf_readmission t… (#106)
* fixed the package name cehrbert -> cehrbert_data for hf_readmission task in README
* created a section to warn users about the incompatibility between OMOP and MEDS formats
Copy file name to clipboardExpand all lines: README.md
+7-1Lines changed: 7 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -62,6 +62,12 @@ Build the project
62
62
pip install -e .[dev]
63
63
```
64
64
65
+
## OMOP vs. MEDS Format Considerations
66
+
CEHR-BERT can be trained using either the OMOP or MEDS data formats; however, models trained on one format are not compatible with those trained on the other.
67
+
This incompatibility arises because CEHR-BERT uses different concept identifiers depending on the format: standard concept IDs (e.g., SNOMED for conditions) in OMOP,
68
+
and source concept IDs (e.g., ICD-9/10) in MEDS. The mappings between these terminologies are many-to-many, making direct alignment between formats unreliable.
69
+
It is therefore crucial to use a consistent data format across pretraining, fine-tuning, and downstream tasks such as linear probing.
70
+
65
71
## Instructions for Use with [MEDS](https://github.com/Medical-Event-Data-Standard/meds)
66
72
Step 1. Convert MEDS to the [meds_reader](https://github.com/som-shahlab/meds_reader) database
0 commit comments