Differences-in-Differences and A (Very) Brief Introduction
Download
Report
Transcript Differences-in-Differences and A (Very) Brief Introduction
Instrumental Variables
Methods of Economic
Investigation
Lecture 15
Last Time
Introduction to Instrumental Variables
Correlation with variable of interest
Exclusion restriction
Interpretation of IV with homogeneous
treatment effects
Gives us a Wald estimate
Nice/well-defined properties of OLS
Today’s Class
Uses for 2SLS
Experiments with compliance issues
Omitted Variable Bias
Heterogeneous Treatment Effects
LATE framework
Interpretation
Review of Instrumental Variables
Two characteristics
Instrument (Z) is correlated with (S)
Must be that S is always increasing (or always
decreasing)
If it changed signs, then the first stage prediction
wouldn’t work
Instrument (Z) is uncorrelated with other
determinants of the outcome (Y)
This means Z is uncorrelated with unobservables that
affect Y
The only way Z affects Y is through S
Steps to Estimate IV-1
Step 1: The Structural Equation
Y =ρS + η
Problems: S correlated with η
OLS estimates won’t recover causal effect of S
on Y
Step 2: Find an Instrument
Correlated with S
Uncorrelated with η (and so uncorrelated with
the unobservables)
Steps to Estimate IV-2
Step 3: Estimate the First Stage
S = πZ + ν
Can estimate this with OLS
Want to test to see if π is significant—will
return to this in the case of weak instruments
where α is close to zero
Step 4: Obtain the fitted values
Sˆ ˆZ
This is the component of S that is unrelated to
the error term in the structural equation
Steps to Estimate IV -3
Step 5: Estimate the Second Stage
Y Sˆ (ˆZ ) bZ
This is using the fitted value, i.e. the predicted value
of S given the instrument Z
The fitted value captures the component of S that is
uncorrelated with the error
If we want to recover β take the OLS estimate from
the second stage b and divide it by the coefficient
from the first stage α
1
ZY
bˆ N i i
Cov(YZ )
ˆ 1
Cov( SZ )
Z
S
i i
N
Various uses for IV
Goal: Average Effect of S on Y (ATE)
Omitted Variable
Non-Experimental
Experimental
Perfect
Compliance
Imperfect
Compliance
Matching
IV
Measurement Error
IV
Fixed
Effects
Diff-indiff
Perfect
Compliance
IV
Imperfect
Compliance
IV
Things to worry about
1.
2.
3.
Is my instrument really uncorrelated with
other determinants of the outcome?
How do I interpret my IV estimate? What
if I think there are heterogeneous
treatment effects?
How strong does my first stage have to
be for this to all work?
We’ll deal with each of these issues
Can we test the exclusion restriction
This is the assumption of the model and
often cannot be formally tested
The reduced form gives some information
on the reasonableness of the assumption
Knowing π and the OLS biased estimated of ρ
we might gut check how reasonable it is for
the only effect of Z to be through S
If we have multiple instruments, we can test
using the overidentification test
Overidentification
Model is overidentified if we have:
# instruments variables > # endogenous
variables
Models with exactly same number of instruments
as endogenous variables are just identified
If the model is overidentified we can test
the quality of the fit
Testing Model Fit
Suppose we have Q instruments and define
(this is our first stage RHS variable)
Z i [ X i ' z1i ,...,zQi ]'
Wi [ X i ' sˆi ]'and as before ' '
Define
The residuals from the second stage can be
defined as:
i () Yi 'Wi Yi [X i sˆi ]
2SLS Residual Terms
We assume that η is orthogonal to Z so that
E[Zi η(Γ)]=0
The sample analog of this is
1
Z i i () m N ()
N
In finite samples, this won’t be exactly zero
2SLS fits the value of Γ making this closest to
zero
This has an asymptotic distribution of
2
N mN () N (0, ) where E[Zi Zi 'i () ]
The Minimand
There is an underlying Method of Moments
way to illustrate this but we’ll ignore that
for now.
Basic idea is to minimize the quadratic
form of the vector mN(Γ)
The optimal weighting matrix to estimate
this is Λ-1 and then the equation to be
minimized is: J N ( g ) NmN ( gˆ )' 1mN ( gˆ )
The Overidentification Test
Intuition: is mn(g) close enough to zero
for us to believe that Z uncorrelated with
the error (other unobservable stuff)
Null hypothesis: E[ηZ]=0 distributed
Χ2(Q-1)
Can also test this directly
Estimate the just-identified version for the Q
instruments
Test that the estimate coefficients are
statistically indistinguishable
Next time
Issues with IV
Heterogeneity
Weak Instruments