Transcript Slide 1
Research Methods for the
Social Sciences:
An Introductory Course
February 23rd, 2010 – Statistics
Sarah Nelson and Julia Braverman
Division on Addictions, Cambridge Health Alliance
Harvard Medical School
The Dissertation Effect – First
Year Project
19
17
15
13
11
9
7
5
3
1
Control
Intervention
M1
M2
M3
M4
The Dissertation Effect Dissertation
2.3
2.25
2.2
2.15
2.1
Control
Intervention
2.05
2
1.95
1.9
1.85
M1
M2
M3
M4
Overview
Descriptive statistics
– Central tendency
– Variability/Dispersion
Inferential statistics
– Hypothesis testing
– Test review
– Effect size and meta-analysis
Adapted from Dr. Braverman’s slides
Descriptive Statistics
Statistics used to summarize or describe
data
– Frequencies
60% of public high school students say they have
plagiarized papers.
– Mean
Men who are lost wait an average of 20 minutes
before giving up and asking for directions. Women
– 10 minutes.
– Range
Romantic love involves chemical changes in the
brain that last 12 – 18 months. After that, you and
your partner are on your own.
Adapted from Dr. Braverman’s slides
Data
Central tendency
Mean
– ΣX/N
– 740/11
– Mean = 67.3
Class A.
60, 80, 90, 80, 75, 90,
95, 30,70, 80, 70
Mode
– Most frequent value
– 80
N = 11.
Median - Center
value when data is
arranged in order
30 60 70 70 75 80 80 80 90 90 95
Dr. Braverman’s slides
Data
Central tendency
– ΣX/N
– 740/11
– Mean = 67.3
Class A.
60, 80, 90, 80, 75, 90,
95, 30,70, 80, 70
Mean
Mode
– Most frequent value
– 80
Median - Center
value when data is
arranged in order
30 60 70 70 75 80 80 80 90 90 95
Dr. Braverman’s slides
Data
Central tendency
– ΣX/N
– 740/11
– Mean = 67.3
Class A.
60, 80, 90, 80, 75, 90,
95, 30,70, 80, 70
Mean
Mode
– Most frequent value
– 80
Median - Center
value when data is
arranged in order
30 60 70 70 75 80 80 80 90 90 95
Dr. Braverman’s slides
Data
Central tendency
– ΣX/N
– 740/11
– Mean = 67.3
Class A.
60, 80, 90, 80, 75, 90,
95, 30,70, 80, 70
Mean
Mode
– Most frequent value
– 80
Median - Center
value when data is
arranged in order
30 60 70 70 75 80 80 80 90 90 95
Dr. Braverman’s slides
Data
Central tendency
– ΣX/N
– 740/11
– Mean = 67.3
Class A.
60, 80, 90, 80, 75, 90,
95, 30,70, 80, 70
Mean
Mode
– Most frequent value
– 80
Median - Center
value when data is
arranged in order
30 60 70 70 75 80 80 80 90 90 95
Dr. Braverman’s slides
Data
Central tendency
– ΣX/N
– 740/11
– Mean = 67.3
Class A.
60, 80, 90, 80, 75, 90,
95, 30,70, 80, 70
Mean
Mode
– Most frequent value
– 80
Median - Center
value when data is
arranged in order
30 60 70 70 75 80 80 80 90 90 95
Dr. Braverman’s slides
Data
Central tendency
– ΣX/N
– 740/11
– Mean = 67.3
Class A.
60, 80, 90, 80, 75, 90,
95, 30,70, 80, 70
Mean
Mode
– Most frequent value
– 80
Median - Center
value when data is
arranged in order
– Median = 80.
30 60 70 70 75 80 80 80 90 90 95
Dr. Braverman’s slides
Mean and Median
Mean
– Reflects ALL values
Dr. Braverman’s slides
Median
– No extreme values
– Exactly 50% above
and below.
Variability
Range
Standard deviation/dispersion
Dr. Braverman’s slides
Dispersion/standard deviation
Country A
–
–
–
–
–
–
–
15,000
20,000
50,000
60,000
80,000
90,000
100,000
Country B
–
–
–
–
–
–
–
40,000
40,000
50,000
60,000
70,000
70,000
80,000
Standard deviation –
Average distance of all scores from the average.
Dr. Braverman’s slides
Normal Distribution
f
r
e
q
u
e
n
c
y
Dr. Braverman’s slides
values
Normal Distribution
Symmetrical
Mean = Mode = Median
Bell-shaped
Most scores cluster around the mean.
Dr. Braverman’s slides
The normal distribution
The area under the normal curve represents
100% of the scores
A property of the normal curve is that
approx. 99% of scores fall within 3 standard
deviations of the mean
– Specifically
~34.13% within one SD in one direction
~68.26% within one SD
~95.44% within two SD
Dr. Braverman’s slides
Normal and z-scores
Dr. Braverman’s slides
Normal distribution of IQ
= 100
= 15
Mode = 100
Median = 100
Number of people
Frequency of IQ
30 40 50 60 70 80 90 100 110 120 130 140 150 160 170
IQ
Dr. Braverman’s slides
Central Tendency
Dr. Braverman’s slides
Inferential statistics
Drawing a conclusion about general
population based on a sample.
Dr. Braverman’s slides
A Quick Aside
Population computations:
– Often provided in text books for pedagogical reasons.
– Requires that you have every subject, i.e. you are not
making any inferences to a more general population
– Uses Greek letters:
Sample computations:
– Often presented as computational formulas
– Allows one to make inferences to a more general
population
– Uses Arabic letters: M S X
– Uses degrees of freedom
Dr. Braverman’s slides
Hypothesis Testing
9 Step procedure
–
–
–
–
–
–
–
–
–
State the hypotheses
Determine the nature of variables
Choose the appropriate test statistic
Set Type I error rates (alpha)
Determine your sample size
Collect data
Run appropriate statistical test
Calculate observed effect size
Make a decision and conclusions
Dr. Braverman’s slides
Hypotheses
Null hypothesis: H0
– States that nothing special is happening in
respect to some characteristic of the
underlying population
Alternative hypothesis H1
– The opposite of the null hypothesis, also
called the research hypothesis
Dr. Braverman’s slides
Example
Students participate in an SAT prep class
in English. The experimenter thinks that
this class will improve the students scored
compared to the national average of 500.
– Null: There will be no difference in the
performance of the participants in the prep
group. X- <= 0
– Alternative: Student in the prep group
perform better than the national average. X-
>0
Dr. Braverman’s slides
Rejecting Null hypothesis
With the given set of data, how likely (p)
is it that the null hypothesis is true?
α = .05/ .01/ .001 (Arbitrary setting)
If p < α
Reject H0
Dr. Braverman’s slides
Error
Type I error: Probability of rejecting a null
hypothesis when there is no effect
present. Alpha level, generally set to .05
Type II error: Probability of retaining a
false null, i.e., missing an effect. Beta
– Power is the opposite of Beta and is the
probability of detecting a present effect.
Power is useful for determining a sample size
Dr. Braverman’s slides
Significance of significance
1.
2.
What significance does NOT mean..
The effect is IMPORTANT
The effect is LARGE
Dr. Braverman’s slides
Effect size
Effect size = (mean of experimental group mean of control group)/standard deviation
Cohen’s d (can be bigger than 1)
–
–
–
–
Less than .1 trivial
.1-.3 small
.3-.5 moderate
Greater than .5 large
r - correlation coefficient
– <.2 - small
– .5 - large
Dr. Braverman’s slides
Power
The probability to reject null hypothesis
when it is, in fact, false.
Power is bigger if
– Effect size is bigger
– Sample size is bigger
– Alpha is bigger
Dr. Braverman’s slides
What Kind of Test to Use?
1.
Define your variables.
– Independent Variables:
what you manipulate
– Dependent Variables:
what you measure
Dr. Braverman’s slides
Numerical or Categorical
Numerical
– Values defined by numbers.
– You can calculate mean and standard
deviation.
Categorical
– Values defined by labels.
– You can calculate frequency (how many of
each category).
Dr. Braverman’s slides
Tests to Use if All Variables are
Numerical
Examples:
– How SAT is related to the average GPA during the senior year of
college.
– How attitudes toward George Bush related to the attitudes
toward abortion.
Test to use:
– Regression/Correlation analysis
Statistics: r
Dr. Braverman’s slides
Linear Correlation
Examining the relationship between height
and self esteem
Examining the relationship between
emotional intelligence and social skills.
Dr. Braverman’s slides
Linear Correlation
Positive (r > 0)
Negative (r < 0)
Help
Performance
Hours of sleep before the exam
Dr. Braverman’s slides
Number of bystanders
Dr. Braverman’s slides
r=?
Performance
r=0
Anxiety
1>r>0
r- correlation
A social scientist wishes to determine
whether their is a relationship between
the attractiveness scores (on a 100-point
scale) assigned to college students by a
panel of peers and their score on a paperand-pencil test of anxiety.
– Variable 1 (numerical): attractiveness
– Variable 2 (numerical): anxiety
Dr. Braverman’s slides
Tests to Use if Variables are
Mixed: Categorical + Numerical
Examples:
1.
2.
Do females experience more empathy toward a stranger than
males do?
Which religious group gives more support toward the
president.
Test to use:
– t-test or different kinds of ANOVA.
the number of groups and variables.
Dr. Braverman’s slides
Depends on
t-test for independent samples.
A school psychologist compares the
reading comprehension score of migrant
children who, as a result of random
assignment, are enrolled in either a
special bilingual reading program or a
traditional reading program.
– IV( categorical, 2 levels): Reading program
– DV (numerical): Reading score.
Dr. Braverman’s slides
t-test for matched samples.
To determine whether speed reading influences
reading comprehension, a researcher obtains
two reading comprehension scores for each
student in a group of high school students, once
before and once after training in speed reading.
– IV (categorical, 2 levels) : training
– DV (numerical): reading comprehension score
Dr. Braverman’s slides
One-way ANOVA
In a study of problem solving, a
researcher randomly assigns college
students to one of three groups: highstatus leader, equal-status leader or no
leader, and measure the amount of time
required to solve a complex puzzle.
– IV (categorical, 3 levels): type of leader
– DV (numerical): time
Dr. Braverman’s slides
2-way ANOVA
To determine whether cramming can increase GRE
scores a researcher randomly assigns college students to
either a specialized GRE test-taking workshop, a general
test-taking workshop, or a control (non-test-taking)
workshop. Furthermore, to check the effect of
scheduling, students are randomly assigned, in equal
number, to experience their workshop either during one
long marathon weekend or during weekly sessions.
– DV (numerical): GRE score
– IV (categorical, 3 levels): kind of workshop
– IV (categorical, 2 levels): session
Dr. Braverman’s slides
Tests to Use if There are No
Numerical Variables
Example:
– Is there a different frequency of rainy days in the four
seasons?
Test
to use:
– One variable
One-way Chi-square
– Two variables
Two-way Chi square
Dr. Braverman’s slides
One-Way Chi-Square.
A random sample of 90 college students
indicates whether they most desire love,
wealth, power, health, fame, or family
happiness.
– Variable 1 (categorical): desire.
Dr. Braverman’s slides
2-way Chi-Square
A social scientist cross-classifies the
responses of 100 randomly selected
people on the basis of gender and
whether or not they favor strong gun
control laws.
– Variable 1 (categorical, 2 levels): gender
– Variable 2 (categorical, 2 levels): opinion
toward gun control.
Dr. Braverman’s slides
Article 1: Meghany et al., 2009. Predictors of
resolution of aberrant drug behavior in chronic pain
patients treated in a structured opioid risk
management program
Research Question:
– For chronic pain patients prescribed opioids
who are at risk for developing addiction to
opioids, what predicts success in a program
designed to manage those risks?
This is a question about moderators of a
supposed treatment effect
Article 1: Meghany et al., 2009. Predictors of
resolution of aberrant drug behavior in chronic pain
patients treated in a structured opioid risk
management program
Sample:
– Consecutive referrals to the Opioid Renewal
Clinic for aberrant drug related behaviors
(ADRBs) over the course of two and a half years
(N = 195 [49% of the 401 total referred])
– All participants had chronic non-cancer-related
pain
Article 1: Meghany et al., 2009. Predictors of resolution of
aberrant drug behavior in chronic pain patients treated in a
structured opioid risk management program
Measures: Predictors
– Demographics: Age, Race, Gender, Marital
Status, Employment, Veteran Status/Impairment
– Pain: Primary Diagnosis, # of Pain Diagnoses
– Comorbidity: Medical and Psychiatric
Measures: Outcome
– Staying in the program vs. Discharge/Drop-out
Assumes resolution of ADRBs and negative urine
screens for illicit drugs
Article 1: Meghany et al., 2009. Predictors of
resolution of aberrant drug behavior in chronic pain
patients treated in a structured opioid risk
management program
Predictors are:
– Categorical
– Continuous
Outcome is:
– Categorical
Specifically, dichotomous
Article 1: Meghany et al., 2009. Predictors of resolution of
aberrant drug behavior in chronic pain patients treated in a
structured opioid risk management program
Analysis Plan
1. Descriptives
–
51% (106) did not complete the program
–
–
–
–
58% (61) discharged
19% (20) moved into addiction treatment
23% (25) self-discharged/dropped out
General tendencies on the predictors for the sample
2. Comparison of groups (89 vs. 106) on predictor
variables
–
–
No demographic differences
Successful completers had higher medical comorbidity and
more pain diagnoses, were more likely to have a history of
depression, and were less likely to have abused cocaine
Article 1: Meghany et al., 2009. Predictors of resolution of
aberrant drug behavior in chronic pain patients treated in a
structured opioid risk management program
Analysis Plan
3. Multivariate Test: Logistic Regression
– Point of multivariate test: Takes the correlation
between variables into account so you learn which
variables have unique effects
– Certain relationships disappear because of those
correlations and others emerge
– # of pain diagnoses and history of cocaine abuse
remained significant independent predictors; marital
status emerged as a protective factor
Resources
• Gonick & Smith (1993). The Cartoon Guide to
•
•
•
Statistics.
Grimm & Yarnold (1995). Reading and
Understanding Multivariate Statistics
Grimm & Yarnold (2000). Reading and
Understanding More Multivariate Statistics
Abelson (1995). Statistics as Principled Argument.