Skip to content

BoyceLab/DataBasics

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 

Repository files navigation

DataBasics: A Resource on Research Using Electronic Health Records

Introduction

Electronic Health Records (EHR) have revolutionized the way healthcare information is stored, accessed, and used across the medical industry. This primer aims to provide a lay audience with an understanding of EHR data, its history, key definitions, and insights into the standards and methodologies for extracting EHR data for research. We'll explore HL7, CDA, and FHIR standards, the differences between direct EHR data extraction and using OAuth 2.0, and review common data models along with the pros and cons of various data extraction processes.

History of EHR Data

EHR data's history dates back to the 1960s with the earliest systems focusing on simple data capture and storage. Over the decades, EHR systems have evolved significantly, incorporating more complex data types such as imaging and laboratory results, and enabling more sophisticated data sharing and interoperability. The goal has always been to improve patient care, streamline healthcare operations, and facilitate health research. EHR data vendors include Epic and Cerner.

Definitions

  • EHR (Electronic Health Record): A digital version of a patient's paper chart. EHRs contain the patient's medical history, diagnoses, medications, treatment plans, immunization dates, allergies, radiology images, and laboratory test results. EHRs are designed to be shared with other providers across more than one healthcare organization to provide accurate, up-to-date, and complete information about patients at the point of care.

  • HL7 (Health Level Seven International): A set of international standards for the exchange, integration, sharing, and retrieval of electronic health information. HL7 standards support clinical practice and the management, delivery, and evaluation of health services.

  • CDA (Clinical Document Architecture): An HL7 standard designed to allow healthcare providers to exchange clinical documents, such as discharge summaries and progress notes, in a readable and structured format.

  • FHIR (Fast Healthcare Interoperability Resources): The latest standard by HL7, designed to enable healthcare information to be available, discoverable, and understandable globally, and to support a wide range of applications, including mobile apps, cloud communications, and data analysis algorithms. For a comprehensive understanding of Fast Healthcare Interoperability Resources (FHIR) concepts and data models, refer to the Microsoft Learn documentation on FHIR. From this training, "US Core provides the framework for the implementation of regulations that are driven by the Centers for Medicare & Medicaid Services (CMS) and the ONC Final Rule [https://www.healthit.gov/topic/oncs-cures-act-final-rule]. Payors and providers across the United States who accept Medicare/Medicaid patients implement FHIR® by following the profiles that are defined in US Core.

  • OAuth 2.0: An authorization framework that enables applications to obtain limited access to user accounts on an HTTP service, such as Facebook, GitHub, and digital health records. NIH's All of Us Study uses OAuth 2.0 via Sync for Science.

  • APIs An API, or Application Programming Interface, is a set of rules, protocols, and tools for building software and applications. It specifies how software components should interact and allows different software applications to communicate with each other. APIs play a crucial role in enabling the integration of systems, services, and applications by providing a predefined way for accessing a set of functions or data.

In web development, APIs are often used to enable the interaction between different web services. For example, a weather service might provide an API that allows developers to integrate weather data into their own websites or applications. Similarly, social media platforms offer APIs that enable developers to create apps that can post updates, access user data (with permission), or analyze trends based on the platform's data.

APIs can be categorized into different types, including:

  • Web APIs: These are designed for both the web server and web browser. They include REST (Representational State Transfer), SOAP (Simple Object Access Protocol), and GraphQL APIs, among others, and are used for web services communication.

  • Operating System APIs: These provide functions for managing and controlling computer hardware and basic system operations. They allow developers to write applications compatible with the operating system.

  • Database APIs: These allow communication between an application and a database management system. Developers can use these APIs to create, read, update, and delete database entries.

  • Remote APIs: These are used for communication between software on different machines across a network, enabling distributed computing.

Extracting EHR Data

Extracting EHR data can be done directly from the EHR system at a healthcare site or via web-based APIs using protocols like OAuth 2.0. Direct extraction involves accessing the EHR database or data warehouse, often requiring custom queries and significant IT support. OAuth 2.0, meanwhile, allows researchers to access data through predefined APIs with the user's consent, facilitating more standardized and scalable data access.

When it comes to extracting EHR data for clinical research, the devil is in the details. Do not assume that all approaches can extract 100% of the relevant information or records. Some information is either a) not reoutinely collected in EHR or b) not available through the extraction approach selected.

Pros and Cons of Different Data Extraction Processes

Direct Extraction:

  • Pros: Can access comprehensive EHR data; allows for custom queries.
  • Cons: Requires significant IT resources; potential privacy and security concerns; may be time-consuming.

OAuth 2.0/API-Based Extraction:

  • Pros: Standardized access method; scalable; less IT-intensive; enhances data security .
  • Cons: May offer limited access to EHR data; dependent on the availability and functionality of APIs.
Extraction Method Pros Cons
OAuth/API-based Standardized access; scalable; less IT-intensive; enhances security by limiting access May offer limited access to data; dependent on API functionality
Direct Extraction Access to comprehensive EHR data; allows for custom queries Requires significant IT resources; potential privacy concerns

Common Data Models

Common Data Models (CDMs) standardize the format and terminology of healthcare data from various sources, making it easier to aggregate, share, and analyze data across different systems. Examples include the Observational Medical Outcomes Partnership (OMOP) and the Patient-Centered Outcomes Research Network (PCORnet).

Data Standards, Common Data Elements, and Validated Instruments

CDISC: Clinical Data Interchange Standards Consortium. The Clinical Data Interchange Standards Consortium (CDISC) is a global non-profit organization that develops and supports global data standards to improve the quality and efficiency of clinical research. CDISC standards are designed to facilitate the harmonization of clinical data and streamline research processes from protocol through analysis and reporting. This standardization supports the direct use of EHR data for research and regulatory submissions, making it easier to compare and combine data from different studies or sources.

Common Data Elements (CDEs)

Common Data Elements (CDEs) are precisely defined questions and answer choices, along with associated metadata, used in clinical research and patient care. They are intended to promote the collection of standardized data across studies and health records. By standardizing the way data is collected, stored, and exchanged, CDEs enable researchers to more easily compare and combine data from different sources, enhancing data quality and research efficiency.

Validated Instruments

Validated instruments are tools or questionnaires that have been scientifically proven to accurately and reliably measure a specific concept or construct, such as symptoms, quality of life, or functional status. Validation involves rigorous testing to ensure the instrument consistently produces accurate data that reflects what it is designed to measure across different populationsand settings.

Integrating CDISC, CDEs, and Validated Instruments in EHR Data for Research

The integration of CDISC standards, CDEs, and validated instruments into EHR systems and research protocols can significantly enhance the quality, efficiency, and comparability of health data. This integration supports streamlined data collection and analysis, regulatory compliance and submission, and enhanced interoperability.

Resources and "MUST READ"papers:

EBMonFHIR-based tools and initiatives to support clinical research

  • Authors: Alper B. S.
  • Journal: Journal of the American Medical Informatics Association : JAMIA, 30(1), 2022.
  • DOI/URL: 10.1093/jamia/ocac193
  • PMC: PMC9748541

HL7 FHIR-based tools and initiatives to support clinical research: a scoping review

  • Authors: Duda SN, Kennedy N, Conway D, Cheng AC, Nguyen V, Zayas-Cabán T, Harris PA.
  • Journal: J Am Med Inform Assoc, 29(9), 2022.
  • DOI: 10.1093/jamia/ocac105
  • PMID: 35818340; PMCID: PMC9382376.
  • URL: PubMed

Technology-Enabled Clinical Trials: Transforming Medical Evidence Generation

  • Authors: Marquis-Gravel G, Roe MT, Turakhia MP, Boden W, Temple R, Sharma A, Hirshberg B, Slater P, Craft N, Stockbridge N, McDowell B, Waldstreicher J, Bourla A, Bansilal S, Wong JL, Meunier C, Kassahun H, Coran P, Bataille L, Patrick-Lake B, Hirsch B, Reites J, Mehta R, Muse ED, Chandross KJ, Silverstein JC, Silcox C, Overhage JM, Califf RM, Peterson ED.
  • Journal: Circulation, 140(17), 2019.
  • DOI: 10.1161/CIRCULATIONAHA.119.040798
  • PMID: 31634011

For more information on HL7, CDA, and FHIR standards, and on extracting EHR data, you can visit the following websites:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published