Research Use Case
Description
The purpose of the Making Electronic Data More Available for Research and Public Health (MedMorph) Research Data Exchange Implementation Guide (IG) is to streamline and expedite the process of onboarding research data partners and contributing data to research networks. The current processes used to extract, transform, and load (ETL) the data and then contribute the data involve many non-standardized mechanisms (e.g., Secure File Transfer Protocol (SFTP), Excel Files, stored procedures), different structures (e.g., formats), and different semantics. As a result of these non-standardized processes, the length of time to onboard a data partner varies from weeks to months. This research data exchange use case along with leveraging the MedMorph Reference Architecture (RA) IG with other existing Health Level 7 (HL7®) Fast Healthcare Interoperability Resources (FHIR®) IGs will help reduce the length of time it takes to onboard new data exchange partners.
Problem Statement
Goals of the Use Case
The goals of the Research Data Exchange use case include:
Improve efficiency in onboarding new data exchange partners
Improve data latency
Increase the volume, quality, completeness, and timeliness of data submitted to research organization at a lower cost
Provide standardize way to collect and exchange data for research
Leverage the MedMorph RA IG
Streamline data element mapping from the data mart
Scope of the Use Case
In-Scope
Bulk data exchange
Identify the data elements to be retrieved from the Data Source for research needs
Out-of-Scope
State and local policies around data exchange for research
Assessment of the data quality of the content extracted from the Data Source.
Data captured outside the Data Source and communicated directly to data marts.
Data exchange/Data use agreements between data source and research organization
Utilization of unstructured data (e.g., images or text blobs)
Use Case Actors
Data Source: A system (e.g., EHR, HIE) used in care delivery for patients and that captures and stores data about patients and makes the information available instantly and securely to authorized users. While an EHR does contain the medical and treatment histories of patients, an EHR system is built to go beyond standard clinical data collected in a provider’s provision of care location and can be inclusive of a broader view of a patient’s care. EHRs are a vital part of health IT and can:
Contain a patient’s medical history, diagnoses, medications, treatment plans, immunization dates, allergies, radiology images, and laboratory and test results
Allow access to evidence-based tools that providers can use to make decisions about a patient’s care
Automate and streamline provider workflow
A FHIR Enabled Data Source exposes FHIR APIs for other systems to interact with the EHR and exchange data. FHIR APIs provide well defined mechanisms to read and write data. The FHIR APIs are protected by an Authorization Server which authenticates and authorizes users or systems prior to accessing the data.
Backend Services App: A system that resides within the clinical care setting and interacts with the EHRs, Data Marts and the systems used by the researchers. The system will use specific Knowledge Artifact resources to identify when data has to be extracted, when data has to be queried and how results have to be returned.
Trust Service Provider: Trust Service Provider affords capabilities that can be used to translate data between different models, map terminologies between different data models, pseudonymize, anonymize, de-identify, hash, or re-link data that is submitted to public health and/or research organizations. These capabilities are called Trust Services. Trust Services are used, when appropriate, by the Backend Service App.
Data Mart: A system that will hold data that is to be accessed by researchers. Typically, researchers do not access the EHR or operational data stores directly as they are used for clinical operations. In order to facilitate access to researchers’ healthcare organizations populate Data Marts that can be accessed by researchers. The data models (e.g., PCORnet CDM, i2b2, OMOP, Sentinel, FHIR) that are used to store the data may depend on the types of research studies.
Research Abstract Model
Figure 1. Research Use Case Abstract Model
Use Case User Stories and Diagrams
Preconditions
IRB Approval
User Stories
User Story 1 – Onboard a New Research Data Partner
As a research network administrator, there is a need to onboard research data partners to join the research network and contribute data that can be used for research. The data partners extract, transform, and load (ETL) the data from data sources such as Electronic Health Records (EHRs), Health Information Exchanges (HIEs), or other data repositories (e.g., clinical data warehouses). These processes (e.g., ETL) when standardized will expedite onboarding processes and deliver better quality data at a lower cost.
Table 1: Onboard a New Data Partner Workflow
Step | Actor | Role | Activity | Input(s) | Output(s) |
1 | Backend Services App | Requestor | Request artifacts from the KAR | Artifact identifiers | KAR request |
2 | KAR | Query Responder | Return the requested artifacts | KAR request | Response with artifacts |
3 | Backend Services App  | Data Receiver | Receive and validate artifacts | Artifacts | Validated artifacts |
4 | Backend Services App | Provisioner | Apply parameters, value sets, etc. from artifacts | Validated artifacts | Updated BSA logic |
5 | Backend Services App | Provisioner | Send value sets with trigger codes to the EHR System | Validated artifacts | FHIR bundle |
6 | EHR System | Data Receiver | Receive and validate FHIR bundle | FHIR bundle | FHIR validated bundle |
7 | EHR System | Provisioner | Instantiate triggers based on BSA value sets | FHIR validated bundle | Triggers |
User Story 1 – Onboard a New Research Data Partner Activity Diagram
Figure 2 below illustrates the flow of events and information between the actors for the Onboard a New Research Data Partner workflow.
Figure 2: Onboard a New Research Data Partner User Story Activity Diagram
User Story 1 – Onboard a New Research Data Partner Sequence Diagram
Figure 3 below represents the interactions between actors in the sequential order that they occur in the onboarding of a new research data partner workflow.
Figure 3: Onboard a New Research Data Partner User Story Sequence Diagram
User Story 2 – Accessing Additional Data for a Specific Research Question (future consideration)
Once a data partner joins a research network and contributes data, a researcher should be able to perform queries to filter specific data sets and analyze the data. Currently researchers can query data marts on a limited basis because of the variations in the data models (e.g., National Patient-Centered Clinical Research Network (PCORnet) Clinical Data Management (CDM), Informatics for Integrating Biology & the Bedside (i2b2), Observational Medical Outcomes Partnership (OMOP), FHIR Resources) and the technologies (e.g., Microsoft Structured Query Language (SQL), Postgres SQL, Statistical Analysis System (SAS)) used to host the data mart. Standardizing these data access mechanisms will help overcome the hurdles faced by researchers in accessing data from data partners and EHRs. NOTE: User Story #2 is out of scope for this version of the IG, it is being included only as a potential use case that could be expanded on in the future.
Â
Â