!2012-06-23 EDM-F Stakeholder Symposium Data Models KAHN V3.ppt

Download Report

Transcript !2012-06-23 EDM-F Stakeholder Symposium Data Models KAHN V3.ppt

Data Model Considerations for Clinical
Effectiveness Researchers
Michael G. Kahn1,3,4, Deborah Batson4, Lisa Schilling2
1Department
of Pediatrics, University of Colorado, Denver
2Department of Medicine, University of Colorado, Denver
3Colorado Clinical and Translational Sciences Institute
4Department of Clinical Informatics, Children’s Hospital Colorado
Electronic Data Methods (EDM) Forum Stakeholder Symposium
Building an Electronic Clinical Data Infrastructure to Improve Patient Outcomes
23 June 2012
[email protected]
Funding was provided by a contract from AcademyHealth. Additional support was provided by AHRQ 1R01HS019912-01 (Scalable PArtnering
Network for CER: Across Lifespan, Conditions, and Settings), AHRQ 1R01HS019908 (Scalable Architecture for Federated Translational Inquiries
Network), and NIH/NCRR Colorado CTSI Grant Number UL1 RR025780 (Colorado Clinical and Translational Sciences Institute).
Disclosures
Presentation based on EDM Forum commissioned paper:
“Data model considerations for clinical effectiveness researchers”
Medical Care 50(9) 2012 S60-S67 DOI: 10.1097/MLR.0b013e318259bff4. (publish
ahead of print).
2
What is a data model & why should I care?
• A data model determines:
– What data elements can be stored
– What relationships between data
can be represented
– Technical stuff: data type,
allowed ranges, required versus
optional (missingness)
• You should care because it
determines:
– How easy can data be recorded
– How easy can data be extracted
– Contributes to data quality
3
Visit- versus Patient-centric data models
4
Query Complexity: “For each patient, how many
medications where filled over a period of time?”
Four-table join
Three-table join
5
Query Complexity: “Average number of
prescriptions written per visit?”
Two-table join
Three-table join +
Date comparisons
6
SAFTINet Asthma Cohort Definition
1.
Adults (ages 18 and over) as of Jan 1, 2009 receiving care in
selected sites who:
– Have had at least 2 visits separated by at least 30 days coded as
493.xx in the 18 months prior to July 1, 2011, OR
– A single diagnosis of 493.xx AND two filled prescriptions for an asthma
maintenance medication separated by at least 30 days in the past 12
months.
2.
Exclusion criteria: Patients with other concomitant chronic lung
disease
–
–
–
–
–
Cystic fibrosis
COPD, emphysema, chronic bronchitis
Alpha-1-antitrypsin deficiency
Pulmonary fibrosis
Active TB
7
Additional Data Model Requirements for SAFTINet
• Create patient-level analytic data sets
• Calculate ages to the year for adults, and to smaller units
of measurement for children
• Calculate prescribed drug intervals (drug exposures)
• Link data across disparate data sources
• Use standardized terminologies to take advantage of
conceptual hierarchies and relationships
• Identify a patient as being part of a defined cohort
• Support limited data sets compliant with HIPAA
8
Key questions for a data model
• From Jeff Brown regarding FDA Sentinel Initiative*:
1. What does the system need to do?
2. What data are needed to meet
system needs?
3. Where will the data be stored?
4. Where will the data be analyzed?
5. Is a common data model needed,
and if so, what will the model look like?
*Brown JS, Lane K, Moore K, Platt R. Defining and evaluating possible database models to implement the FDA Sentinel initiative. U.S. Food and Drug Administration;
May 2009 2009.
9
Eight dimensions of data models
Modified from Moody and Shanks*
Dimension
Original definition
Recasted definition for CER
1. Completeness
Does the data model contain all
user requirements?
Can the data model store and retrieve
data to meet investigator CER needs?
2. Integrity
Does the data model conform to
the business rules and
processes to guarantee data
integrity and enforce policies?
Does the data model enforce
meaningful data relationships and
constraints that uphold the intent of
the data’s original purpose, i.e., clinical
care, billing?
3. Flexibility
Does the data model deal with
changes in business and/or
regulatory change?
Can new data elements and
relationships be added if project scope
or if regulatory rules (e.g., patient
identification) changes?
4.
Are the concepts and structures
Understandability in the data model easily
understood?
Do the concepts, structures and
relationships make sense to
investigators, data managers, and
statisticians?
*Moody DL, Shanks GG. Improving the quality of data models: Empirical validation of a quality management framework. Information Systems. 2003;28:619-650.
10
Eight dimensions of data models
Modified from Moody and Shanks*
Dimension
Original definition
Recasted definition for CER
5. Correctness
Does the data model conform to
the rules of the data modeling
technique?
Does the model conform to good data
modeling practices such as limited
data storage redundancy?
6. Simplicity
Does the data model contain the
minimum possible entities and
relationships?
Are concepts represented as
straightforwardly as possible? Are all
data element necessary?
7. Integration
Is the data model consistent with
the rest of the organization’s
data?
Do all of the various data domains,
such as demographics, observations,
labs and medications “hang together”
in a consistent and logical fashion?
8.
Implementability
Can the data model be
implemented within existing time,
budget, and technology
constraints?
Can the data model be implemented
and maintained by current and future
partners given anticipated budgets,
time, and technical constraints?
*Moody DL, Shanks GG. Improving the quality of data models: Empirical validation of a quality management framework. Information Systems. 2003;28:619-650.
11
12
Potential data models considered by SAFTINet
Name
Developing entity
Initial Purpose
Observational
Medical Outcomes
Project (OMOP)
Foundation of the NIH
Comparative Drug Outcomes Studies
Virtual Data
Warehouse (VDW)
HMO Research Network
Distributed data warehouse to allow
comparative studies across collaborating
sites: HMORN, CRN, Oregon CTRI
i2b2
Partners Healthcare
Informatics framework for clinical and
biological data integration
OpenMRS
Regenstrief Institute
Open source enterprise medical record
system platform
OpenEHR
OpenEHR Foundation
Semantically-enabled open source health
computing platform
13
Summary Findings
• None of the existing publicly-available data
models met all requirements
• License-free, flexibility, active community and
willingness to collaborate were key features for
SAFTINet
• Each project has different requirements and
priorities.
• There is no best model for all potential CER uses
14
Data Model Considerations for Clinical
Effectiveness Researchers
Questions?
Electronic Data Methods (EDM) Forum Stakeholder Symposium
Building an Electronic Clinical Data Infrastructure to Improve Patient Outcomes
23 June 2012
[email protected]
Funding was provided by a contract from AcademyHealth. Additional support was provided by AHRQ 1R01HS019912-01 (Scalable PArtnering
Network for CER: Across Lifespan, Conditions, and Settings), AHRQ 1R01HS019908 (Scalable Architecture for Federated Translational Inquiries
Network), and NIH/NCRR Colorado CTSI Grant Number UL1 RR025780 (Colorado Clinical and Translational Sciences Institute).