Analytic Epidemiology - University of Nevada, Las Vegas

Download Report

Transcript Analytic Epidemiology - University of Nevada, Las Vegas

Analytic Epidemiology
Determining the
Etiology of Disease
Study Development Process
Descriptive Studies:
Data Collection and
Analysis
Analyze results and
retest
Model Building and
hypothesis
formulation
Analytic Studies for
Hypothesis testing
Causality and Causal
Relationships



Must have statistical significance
Association may be either positive or
negative (if positive, the association is
higher than expected; if negative the
association is lower than expected)
Must try to rule out “noise” (assuring the
comparison of “apples to apples” by
controlling confounding factors)
Artifactual or Spurious
Associations




A false or fictitious association can result
from chance occurrence or bias in the
study methods
Type 1error occurs from random
fluctuation
Through retesting, we can determine
“spurious relationships.
Non-causal associations take place when
a factor and disease are associated
indirectly
Causal Association
Strength of association
 Dose-Response Relationship
 Consistency of the Association
 Temporally Correct Association
 Specificity of the Association
 Coherence with Existing Information

Sources of Data
Primary data – information collected
directly by the researcher
 Secondary data – data that has
already been collected and stored
for analysis

Types of Surveys
Administrative surveys, medical
records, vital records and statistical
data
 Telephone surveys
 Self-administered surveys
 Personal interviews

Measurement Issues
Measurement is an attempt to
assign numbers to observations
according to a set of rules
 Variables can be categorical or
continuous
 Intent is to translate observations
into a system that allows
assessment of the hypothesis

Types of Categorical
Variables




Nominal variables – assigns name or
number purely on arbitrary basis (e.g.,
race, sex)
Ordinal variables – measures assigned
from (typically) a lesser to greater value
Interval variables – scale that assigns a
number to an observation based on a
constant unit of measurement
Ratio – assigns numbers to observations
to reflect a true point
Improving the Survey







How is measure administered?
Has it been used on similar situations
with success?
Is measure understandable by those
being surveyed?
Is sample accessible and identifiable?
Is special training required?
What is length of time in measurement?
Are results available in timely manner?
Reliability and Validity
Issues
Reliability – the extent to which a
measurement has stability and
homogeneity
 Validity – represents the precision to
which the measure is truly
measuring the phenomena being
measured (measure must be reliable
to be valid)

Reliability and Validity
Issues
Content Validity – the extent to
which the measure reflects the full
concept being studied
 Criterion Validity – assessed by
comparing the test measure of the
phenomenon

Sensitivity
Sensitivity (Se) – measures how
accurately the test identifies those with
the condition or trait, i.e., correctly
identifies or captures true positives
 High sensitivity is needed:

When early treatment is important
 When identification of every case is
important

Specificity
Specificity (Sp) – measures how
accurately the test identifies those
without the condition or trait, i.e.,
correctly identifies or excludes the true
negatives.
 High specificity is needed when:

Re-screening is impractical
 When reducing false positive is important

Factors to consider in setting
cutoffs
Cost of false positives v. false negatives
 Importance of capturing all cases
 Likelihood population will be re-screened
 Prevalence of the disease (Pe):
a. Low Pe requires high Sp, otherwise too
many false positives
b. High Pe requires high Se, otherwise too
many false negatives

Disease State
Screening
Test
Disease
Positive
True Positive
(TP)
Negative
False Neg.
(FN)
No Disease
False Positive
(FN)
A B
C D
True Neg.
(TN)
Determining SE and SP
Rates
SE = TP / (TP + FN)
 SP = Specificity = TN / (TN + FP)
 False Neg. Rate = 1 – SE
 False Pos. Rate = 1 - SP
 Positive Predictive Value

TP/(TP + FP)

Negative Predictive Value
TN/(TN + FN)
SE and PE Example

You need to test the validity of cervical
smears – pap smears – to determine the
presence of cancer of the cervix. Smears
were taken from 120 women known to
have cancer of the cervix and from 580
women who did not have cancer of the
cervix. In the laboratory, the smears
were read “blind” as positive or negative.
Of the total 700 smears, 200 were read
as positive, 110 of which came from the
proven cancer cases.
The Research Cycle
Theory
Empirical findings
Statistical Tests
Operational
hypothesis
Observations and
measurements
Types of Studies




Cross-sectional – prevalence rates that
may suggest association (good for
developing theory, but no causal
association)
Retrospective (Case-control) – good for
rare diseases and initial etiologic studies
Prospective (cohort, longitudinal, followup) – yields incidence rates and estimates
for risk. Better for causal association.
Experimental (intervention studies) –
strongest evidence for etiology
Considerations for Study
Design
Stage of hypothesis development
 Nature of disease
 Nature of Exposure
 Nature of study population
 Context of research

Cross-Sectional Studies
Single point in time (snapshot
studies)
 Risk factors and disease measured
at the same time
 Determines prevalence ratios

Cross-Sectional Study
Design
Exposed
Cases
NonCases
Sample
Population
Cases
NonExposed
NonCases
Advantages and Disadvantages of
Cross Sectional Studies
Advantages
 Gives general
description or scope
of problem
 Useful in health
service evaluation and
planning
 Baseline for
prospective study
 Identifies cases and
controls for
retrospective study
 Low-cost
Disadvantages
 No calculation of risk
 Temporal sequence is
unclear
 Not good for rare
diseases
 Selective survival can
lead to bias
 Selective recall can
lead to bias
 Cohort effect may be
misleading
Prospective Study Desgin
Disease free persons are classified on
exposure at beginning of follow-up period
then tracked to ascertain the occurrence
of disease.
Question of Study: Do persons with the
factor of interest develop or avoid the
disease more frequently than those
without the factor or exposure
Prospective Study Design
Exposure
+
Sample
Population
Cases
NonCases
Cases
Exposure NonCases
Prospective Study
Criteria
Obtain Incidence data
 Obtain the incidence among the
exposed A/A+B
 Obtain incidence among the nonexposed to determine relative risk
C/C+D
 Determine Relative Risk
[A/(A+B)]/[C/(C+D)]

Advantages and Disadvantages of
Prospective Studies
Advantages
 Provides good
assessment of
temporal sequence
 Evaluate before
onset of disease
and watch for
disease
Disadvantages
 Selection bias
 Loss to follow-up
 Expensive
Retrospective Study
Design
Subjects are selected on the basis of
disease status: either cases or controls
then classified on the basis of past
exposure
Question of Study: Do persons with the
outcome of interest (cases) have the
exposure characteristic (or history of
exposure) more frequently than those
without the outcomes (controls)
Retrospective Study Design
Exposure Positive
A
Exposure Negative
B
Exposure Positive
C
Exposure Negative
D
Cases
Controls
Retrospective Study Method




Compare the odds of exposure among the
cases with the odds of exposure among
the controls
Odds of Exposure Among Cases =
[A/(A+C)]/[C/A+C)] or A/C
Odds of Exposure Among Controls
=[B/(B+D)]/[D/B+D)] or B/D
Get Odds Ratio or odds of expose among
cases/Odds of exposure among controls
(A/C)/(B/D)
Advantages and Disadvantages of
Retrospective Studies
Advantages
 Less expensive than
cohort (retrospective)
Studies
 Quicker than cohort
 Can identify more
than one exposure
 Good for rare
diseases
 Well design leads to
good etiologic
investigation
Disadvantages
 Selective Survival
 Selective recall
 Temporal sequence
not as clear
 Not suited for rare
exposures
 Gives an indirect
measure of risk
 More susceptible to
bias
 Limited to single
outcome
Experimental Studies
Uses an intervention in which the
investigator manipulates a factor
and measures the outcome
 Elements of a complete experiment

Manipulation of data
 Use of a control group
 Ability to randomize subjects to
treatment groups

Advantages and Disadvantages of
Experimental Studies
Advantages
 Prospective direction
 Ability to randomize
subjects
 Temporal sequence of
cause and effect
 Can control
extraneous variables
 Best evidence of
causality
Disadvantages
 Contrive situation
 Impossible to
control human
behavior
 Ethical Constraints
 External validity
uncertain
 Expensive
Attributable Risk
The rate of disease in the exposed group
attributable to exposure.
 Relative risk measures the strength of the
association
 Attributable risk identifies risk of the
disease attributable to exposure or the
proportion of incidence in exposed group
attributable to exposure
Attributable Risk
Calculation


Begins with (Incidence in the Exposed
Group) - (Incidence in the non-exposed
group).
Search for the proportion of AR
Attributable Risk
Calculation
Incidence in the
Incidence in the
Exposed Group - Exposed Group
Incidence in the Exposed Group
Population Attributable Risk
Requirements
Incidence rate of disease among
those exposed to a trait or
characteristic
 Incidence rate of disease among
those not exposed to the trait or
characteristic
 The proportion of the population
that has the trait or characteristic

PAR Example
Incidence
% of
Inc. lung
lung cancer, smokers in
+ cancer, nonsmokers
population
smokers
% of nonsmokers in
pop.
EXAMPLE Using NV Rates
[(28.0/1000)(.32)] + [(17.0/1000) (.68)] = 20.5
Attributable Risk Example
Incidence, total
population
- Incidence, non-
exposed population
EXAMPLE Using NV Rates
(20.5/1000) -
(17.0/1000) = 3.5/1000
PAR
Incidence, total
population
- Incidence, nonexposed pop.
Incidence in total population
EXAMPLE Using NV Rates
[(20.5/1000)- (17.0/1000)]
20.5
= 3.5/20.5 =
17%
Intervention Comparisons
To demonstrate any therapeutic
effect uses a PLACEBO
 To demonstrate improved therapy
compare to CONVENTIONAL
TREATMENT
 To demonstrate the most effective
regimen compare DIFFERENT
REGIMENS

Blinding in Experimental
Studies




The importance of blinding depends on
the needed outcome. Less important if
the outcome is clear.
Non-blinded – both subject & investigator
know the treatment allocation
Single-blinded – investigator knows,
subject does not know
Double-blinded – neither investigator and
subject
Sources of Bias
During selection of participants
 Absence of blinding allocation can
lead to differential classification
 Other sources of miscalculation
 Withdrawals, ineligible sources, loss
to follow-up
 Premature termination

Selection Bias



Cases and controls, or exposed and nonexposed individuals were selected is such
that an apparent association is observed even if there is no association.
Biased selection - taking from a pool in
which we know the risk is higher is
selection bias.
Small sample size or small response size
Information Bias
Methods of information
about the subjects in
the study are
inadequate and
results show
information gathered
regarding exposures
and/or disease is
incorrect.






Reporting bias
Abstracting records
Bias in interviewing
Bias from surrogate
interviews
Surveillance bias
Recall bias
Other Issues

Confounding Variables




To prove that Factor A is a result of disease B,
we say that a third factor, Factor X is a
Confounder if the following is true:
Factor X is a known risk factor for Disease B.
Factor X is associated with Factor A bit is not
a result of Factor A.
Interactions