|
1 | | -# The STARDUSTT Framework -- Addressing the Curse of Knowledge in OMOP Implementation |
| 1 | +--- |
| 2 | +title: The STARDUSTT Framework -- Addressing the Curse of Knowledge in OMOP Implementation |
| 3 | +hide: |
| 4 | + - title |
| 5 | +--- |
| 6 | + |
| 7 | +## 🎥 Presentation |
| 8 | + |
| 9 | +Watch Dr. Danielle Boyce present **STARDUSTT-ALS** — a new approach to capturing, managing, and analyzing “big data” in natural history studies. |
| 10 | +[](https://youtu.be/9h89iHEiIPs?feature=shared) |
| 11 | + |
| 12 | +--- |
| 13 | + |
| 14 | +## 📄 White Paper |
| 15 | + |
| 16 | +**The STARDUSTT Framework: Addressing the Curse of Knowledge in OMOP Implementation** |
| 17 | + |
| 18 | +**Authors:** Danielle Boyce<sup>1,2,3</sup>, Pavel Goriacko<sup>4</sup>, Pamela Dasher<sup>5</sup>, William Roddy<sup>5</sup>, Wesley Anderson<sup>5</sup>, Smith F. Heavner<sup>5,6</sup> |
| 19 | + |
| 20 | +**Corresponding Author: ** [Danielle Boyce ](mailto:[email protected]) |
| 21 | + |
| 22 | +**Affiliations:** |
| 23 | +1. ALS Therapy Development Institute |
| 24 | +2. Johns Hopkins University School of Medicine |
| 25 | +3. Tufts University School of Medicine |
| 26 | +4. Montefiore Einstein, Center for Health Data Innovations |
| 27 | +5. Critical Path Institute |
| 28 | +6. Clemson University, Department of Public Health Sciences |
| 29 | +--- |
2 | 30 |
|
3 | | -## Presentation |
4 | | -## White Paper |
5 | 31 | ### Executive Summary |
| 32 | + |
| 33 | +Adopting the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) requires collaboration across diverse stakeholders—executives, clinical informaticists, and technical teams—each with different expertise. Too often, the **curse of knowledge** impedes effective communication, slowing progress. |
| 34 | + |
| 35 | +The **STARDUSTT Framework**—**S**ecure Data, **T**echnology Awareness, **A**ctive Clinical Representation, **R**elational Databases, **D**ata Quality, **U**tilization of Git, **S**tandardized Vocabularies, **T**raining & Documentation, **T**ranslation & Communication—offers a structured approach to build readiness and overcome these barriers. |
| 36 | + |
| 37 | +--- |
| 38 | + |
6 | 39 | ### Background |
| 40 | + |
| 41 | +Large-scale OMOP CDM projects such as the National COVID Cohort Collaborative (N3C), All of Us, and EHDEN have created unprecedented opportunities for observational research. Yet, many sites face challenges due to: |
| 42 | +- Limited technical readiness |
| 43 | +- Gaps in shared understanding |
| 44 | +- Lack of role-specific training |
| 45 | + |
| 46 | +One important contributor to these challenges is the **curse of knowledge**—a cognitive bias first described by Camerer, Loewenstein, and Weber (1989) in *The Curse of Knowledge in Economic Settings: An Experimental Analysis*. This bias occurs when individuals with expertise find it difficult to imagine what it is like for others not to possess that knowledge, often leading to overestimations of others’ understanding and ineffective communication. |
| 47 | + |
| 48 | +In OMOP projects, this bias can manifest when experienced informaticists, engineers, or OHDSI community members assume that all team members are familiar with prerequisites such as relational databases, GitHub workflows, or the OHDSI tool stack. Without intentional, role-specific onboarding, new participants can become overwhelmed and disengaged. |
| 49 | + |
| 50 | +Our work on the **CURE ID** project provided direct, practical experience with these challenges and informed the development of the **STARDUSTT Framework** as a structured approach to address them. |
| 51 | + |
| 52 | + |
7 | 53 | ### The STARDUSTT Framework |
| 54 | + |
| 55 | + |
| 56 | +*Figure 1: Overview of the STARDUSTT Framework* |
| 57 | + |
8 | 58 | #### STARDUSTT Components |
| 59 | +1. **Secure Data** — De-identification, privacy protection, and regulatory compliance. |
| 60 | +2. **Technology Awareness** — Familiarity with Docker, SQL, and OHDSI tools. |
| 61 | +3. **Active Clinical Representation** — Inclusion of clinicians in decision-making and workflow mapping. |
| 62 | +4. **Relational Databases** — Understanding schema, joins, and OMOP table relationships. |
| 63 | +5. **Data Quality** — Processes for completeness, accuracy, and plausibility. |
| 64 | +6. **Utilization of Git** — Version control, collaboration, and issue tracking. |
| 65 | +7. **Standardized Vocabularies** — SNOMED, LOINC, RxNorm, and use of Athena for mapping. |
| 66 | +8. **Training & Documentation** — SOPs, MOPs, and curated learning paths. |
| 67 | +9. **Translation & Communication** — Plain-language messaging, feedback loops, and artifacts for different personas. |
| 68 | + |
| 69 | +--- |
| 70 | + |
9 | 71 | ### Implementation Insights |
| 72 | + |
| 73 | +The framework emerged from interviews with both successful and hesitant OMOP adopters. Three key findings shaped its design: |
| 74 | + |
| 75 | +- **Tailor training by role**: Executives, clinicians, and engineers have different needs. |
| 76 | +- **Focus on prerequisites**: Don’t assume baseline skills in databases, GitHub, or coding. |
| 77 | +- **Use shared language**: Maintain glossaries and avoid unnecessary jargon. |
| 78 | + |
| 79 | +--- |
| 80 | + |
10 | 81 | ### Common Problems and Strategies |
| 82 | + |
| 83 | +| Problem | Impact | Strategy | |
| 84 | +|---|---|---| |
| 85 | +| Varying expertise levels | Poor decision-making | Start with foundational training | |
| 86 | +| Overwhelming resources | Loss of engagement | Curated, role-specific materials | |
| 87 | +| Technical jargon | Miscommunication | Accessible language + shared glossary | |
| 88 | +| Complex topics too soon | Frustration | Incremental learning | |
| 89 | +| Passive learning | Low retention | Active exercises and case studies | |
| 90 | +| Infrequent feedback | Persistent gaps | Regular check-ins | |
| 91 | +| One-size-fits-all | Uneven outcomes | Diverse teaching methods | |
| 92 | +| Poor documentation | Limited autonomy | Clear, versioned resources | |
| 93 | +| Isolated learning | Slow problem-solving | Mentorship and peer learning | |
| 94 | +| Theory without practice | Weak skill transfer | Real-world application | |
| 95 | +| Static programs | Outdated skills | Continuous updates | |
| 96 | + |
| 97 | +--- |
| 98 | + |
11 | 99 | ### Applying STARDUSTT to Personas |
12 | | -#### Personas in Practice |
| 100 | + |
| 101 | +| Component | C-Suite Executives | Clinical Informaticists | Data Engineers | |
| 102 | +|---|---|---|---| |
| 103 | +| **Secure Data** | Compliance and strategic value | IRB/PHI safeguards | Advanced security techniques | |
| 104 | +| **Technology Awareness** | ROI of tools | OHDSI tools in clinical workflows | Deep technical training | |
| 105 | +| **Active Clinical Representation** | Sponsor collaboration | Bridge to technical teams | Integrate clinical context | |
| 106 | +| **Relational Databases** | Decision-making role | Basic querying | Dialect-specific scripting | |
| 107 | +| **Data Quality** | Impact on trust | Use of DQD/Achilles | Automated DQ pipelines | |
| 108 | +| **Utilization of Git** | Governance | Simplified workflows | Advanced branching & CI | |
| 109 | +| **Standardized Vocabularies** | Interoperability strategy | Vocabulary use in practice | Mapping & Athena mastery | |
| 110 | +| **Training & Documentation** | Executive dashboards | Guides & glossaries | Tutorials & peer reviews | |
| 111 | +| **Translation & Communication** | Strategic framing | Clinical translation | Technical action from requirements | |
| 112 | + |
| 113 | +--- |
| 114 | + |
| 115 | +### Personas in Practice |
| 116 | + |
| 117 | +**C-Suite Executives** need strategic summaries, compliance implications, and ROI metrics. |
| 118 | +**Clinical Informaticists** benefit from practical workflow examples and clear connections to patient care. |
| 119 | +**Data Engineers** require technical depth and context to align builds with project goals. |
| 120 | + |
| 121 | +--- |
| 122 | + |
13 | 123 | ### Conclusion |
| 124 | + |
| 125 | +The STARDUSTT Framework bridges gaps in knowledge, builds shared understanding, and accelerates OMOP adoption. By addressing role-specific needs and the curse of knowledge, it improves efficiency, reduces frustration, and increases sustainability in data harmonization projects. |
| 126 | + |
| 127 | +--- |
| 128 | + |
14 | 129 | ### Acknowledgements |
| 130 | + |
| 131 | +We thank **Irina Titkova, PhD** (Show-cell.com, Barts Cancer Institute, Queen Mary University of London) for graphic design and **Grace Zaikoski** for editorial support. |
| 132 | + |
| 133 | +--- |
| 134 | + |
15 | 135 | ### References |
| 136 | + |
| 137 | +1. Heavner SF, Anderson W, Kashyap R, Dasher P, Mathé EA, Merson L, Guerin PJ, Weaver J, Robinson M, Schito M, Kumar VK, Nagy P. A Path to Real-World Evidence in Critical Care Using Open-Source Data Harmonization Tools. *Crit Care Explor.* 2023;5(4):e0893. doi:10.1097/CCE.0000000000000893. |
| 138 | +2. Camerer C, Loewenstein G, Weber M. The curse of knowledge in economic settings: An experimental analysis. *J Polit Econ.* 1989;97(5):1232–1254. |
| 139 | +3. Hripcsak G, Mirhaji P, Low AF, Malin BA. Preserving temporal relations in clinical data while maintaining privacy. *J Am Med Inform Assoc.* 2016;23(6):1040–1045. doi:10.1093/jamia/ocw001. |
| 140 | +4. Kahn MG, Callahan TJ, Barnard J, et al. A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data. *EGEMS.* 2016;4(1):1244. |
| 141 | +5. Observational Health Data Sciences and Informatics (OHDSI). *The Book of OHDSI.* 2023. https://ohdsi.github.io/TheBookOfOhdsi/ |
0 commit comments