Secondary Database Analysis II

Transcript Secondary Database Analysis II

Secondary Database Analysis II
Case-Mix Adjustment
What is the Purpose of Health
Services Research?




Income
Ethnicity
EMR
CPOE
Utilization
A1c
Immunization
Medical Errors
Last Time – Uses
Secondary Data:

Can provide new information on health care delivery
(quality; geographic variation; post-marketing adverse
events; cost), on the natural history of disease, and on
regulatory matters

Can save data-collection resources: time, money,
personnel, participant burden

Can save researcher resources: your time, your money

May permit you to build your CV without anyone’s help
or funds – just your time (but co-authors are a good
thing)

Can provide preliminary data for a grant proposal

May enable research on rare events or difficult
populations
Last Time – Precautions


Take note of the study design; is it
suitable?
Consider: What are inclusion & exclusion
criteria? Why? How does that affect the
sample and generalizability?
This Time – Precautions




What bias may be inherent in the database –
perhaps from the population or from the method
of measurement?
Selection bias: who got included
Migration bias: who was lost/gained and why
Other sources of systematic error (bias)?
This Time – Precautions &
Solutions





Control for bias –
Restrict
Match
Stratify
Adjust with covariates
Risk
Adjustment
Validity
infer
Truth in the
Universe
Truth in the
Study
ERRORS
RESEARCH
QUESTION
infer
Nesting/Clustering of
data
Findings in
the Study
ERRORS
STUDY
PLAN
ACTUAL
STUDY
Target
Population
Intended
Sample
Actual
Subjects
Phenomenon
of Interest
Intended
Variables
Actual
Measurements
EXTERNAL
VALIDITY
INTERNAL
VALIDITY
Bias

Systematic error in measurement or a
systematic difference (other than the
one of interest) between groups

Selection



For cohorts, assembly, migration,
contamination, and referral bias
Measurement
Confounding
Bias: Anticipate and Control






Restriction (may lose generalizability)
Matching (limited # factors)
Stratification
Standardization
Multivariate Adjustment
Assuming the worst (sensitivity analyses)


If needed, conduct sensitivity analyses on
multivariable models
>> Always discuss potential impact of
uncorrected bias on your results
Bias: Systematic Error

Selection Bias





Case-Mix/Disease Severity
Nesting/clustering
Measurement Bias
Recall Bias
Investigator Bias
Example: Hospital Mortality
Report Cards



Originally unadjusted
Hospitals without trauma centers,
doing primarily elective surgery, etc.,
looked really good
Made hospitals who took care of the
sickest of the sick look bad
11/05/03
Quality Assessment

Data Quality: Garbage in, garbage out

Risk Adjustment: To remove the
confounding effect of different
institutions providing care to patients
with dissimilar severity of illness and
case complexity
Risk Adjustment
 Controls for patient characteristics that
are related to the outcomes of interest
 Removes the confounding effect, e.g., of
different institutions providing care to
patients with dissimilar severity of illness
and case complexity
 Addresses regional variations
 Inadequate case-mix adjustment can
lead to misclassification of outlier status
Essential Elements of Risk Adjustment




Outcome-specific
Contains specification of the principal
diagnosis
Contains demographics as proxies for
preexisting physiological reserve
Measures # of comorbidities and
allows all the most important
comorbidities to assume their own
empirically derived coefficients
Additional considerations





– outcome-specific: is one available or do you need to
construct & validate one yourself?
– an index incorporates many predictors – do you need
to study some separately?
– established comorbidity indices for case-mix
adjustment include Charlson, Elixhauser, Selim,
developed on various patient pops – suitable for yours?
– other predictors based on your reading of the literature
– do you need omitted constructs?
– propensity scores (for treatment choice; good for small
datasets, for non-randomized studies of treatment effect)
Risk Adjustment & Outcomes




Primary data collection vs.
administrative data
Disease-specific vs. generic
Commercial vs. developed for your
study
Predictors vary by outcomes being
predicted
Classification of Disease States



ICD-9: too many specific codes (n~10,000)
Clinical Classifications for Health Policy
Research (CCHPR): good for chronic illness
and longitudinal care
[http://www.ahrq.gov/data/hcup/his.htm]
Primary diagnosis: good for studies that
focus on a single episode of care
Famous Methods of Risk
Adjustment

DRGs: diagnosis related groups


APACHE III


Adult ICU
PRISM


Used by Medicare to set hospital
reimbursement
Pediatric ICU
Charlson Score

Adult 1 year survival after hospitalization
Risk Adjustment: Charlson

Advantages:


Commonly used case-mix classification
system in the health care industry
System with which most clinicians and
reviewers are familiar
Risk Adjustment: Charlson

Disadvantages




Principal diagnosis not differentiated
Original work did not specify ICD-9 codes
that went into the disease categories
Developed on inpatients predicting
mortality; may not be well suited to
outpatients at low risk of death
Not good for longitudinal care / chronic
illness
Demographic Factors in Risk Adjusters


Age (e.g., age-adjusted Charlson)
Proxies of Social Support





Marital status
Race
Gender
SES (occupation, employment status,
education)
Proxies of Socioeconomic Status


Health insurance status
Home address zip code average income
Race and Gender


Don’t adjust for automatically
Ideally adjust for variation in the
patients’ physiological reserve and
disease burden but not for variation in
care rendered to patients
Propensity Scores






Useful when sample size is small, to conserve
power
An alternative to including a lot of covariates
Ask: propensity for what?
Include as many predictors as possible to get
predicted probability of group membership
(Rosenbaum & Rubin)
Published schema may include predictors you want
to study separately
Best for non-randomized studies of treatment effect
where you want to adjust for the factors that may
have influenced the treatment choices
Study Design: Minimize Bias
Decision #1:
Experimental Study
Apply intervention,
observe effect on
outcome
yes
Alter events
under study?
no
Observational Study
no
Decision #2:
Make measurement
on more than one
occasion?
yes
Cross-Sectional Study
Each subject examined
on only one occasion
Longitudinal Study
Each subject followed
over a period of time
Case-control
Cohort
Risk adjustment is needed…


When subjects are not randomly
assigned
People do not randomly distribute by



Setting
Provider
Risk Adjust

Outcomes = f (intrinsic pt factors;
treatment applied; quality of treatment;
random chance)
On What Factors Should We Risk
Adjust?

Risk of what outcome?




For what population?




Ejection Fraction
Readmission
Activities of Daily Living (feeding, dressing, etc.)
Inpatient
Outpatient
Nursing Home
For what purpose?



Clinical
Quality
Payment
Classes of potential risk
factors







Demographics: age, sex, etc.
Physiologic status
Number and type of medical diagnoses
Cognitive and mental status
Sensory function
Social, economic, environmental factors
Functional/overall health status
An Important Distinction

Disease Severity
Case-mix or co-morbidity

What is the difference?

Disease Severity v. Case Mix
70
60
50
disease 40
severity 30
20
10
0
Patient A
Patient B
HTN
DM
COPD
case-mix
Arthritis
Nesting or Clustering of
Observations
A threat to validity
Nesting/Clustering of Observations


Traditional methods of analysis assume
observations are independent: people who see
same MD are not!
By setting




Multicenter studies
By clinic
By clinician
Key issue is understanding SOURCE of variance in
the observed data


Within group (clinic)
Between groups (clinics)
Survival by OR Team
10 yr survival
60
50
40
OR 1
OR 2
OR 3
30
20
10
0
0
2
4
6
cases per month
8
10
Common Terminology

Intra-class Correlation Coefficient
(ICC)


The extent to which individuals within the
same group are more similar to each
other than they are to individuals in
different groups
The proportion of the true variation in the
outcome that can be attributed to
differences between the clusters
ICC Values




20 primary care clinics
30-32 patients with type 2 DM per
clinic
For process of care measures (foot
exam, labs ordered etc.) ICC = 0.32
For A1c values, ICC = 0.12
Question: Which is more dependent upon site of care:
processes of care or outcomes of care (A1c control)?
Sample Size Implication

Design Effect = 1 + (n-1)ICC
(n= number of subjects per cluster)

So if estimated sample size of 300 per
group for an intervention study, what
sample size would you need for 30
subjects per cluster (10 clinics) and
ICC of 0.12?
Assignment

Design a study:






Experimental
Cross-sectional
Prospective cohort
Case-Control
What factors you would measure to “risk
adjust” and how measure them?
How would you would adjust for
nesting/clustering of data?
Questions?
Common Research Design
Issues in Health Services
Research

Secondary Database Analysis II

Transcript Secondary Database Analysis II

Directory