Chapter 7 - Ken Farr (GCSU)

Download Report

Transcript Chapter 7 - Ken Farr (GCSU)

Chapter 7

Specification: Choosing a Functional Form

Copyright © 2011 Pearson Addison-Wesley.

All rights reserved.

Slides by Niels-Hugo Blunch Washington and Lee University

Choosing a Functional Form

• After the independent variables are chosen, the next step is to choose the functional form of the relationship between the dependent variable and each of the independent variables. • KISS Principle • Let theory be your guide! Not the data!

7-1 © 2011 Pearson Addison-Wesley. All rights reserved.

The Use and Interpretation of the Constant Term

• An estimate of β 0 has at least

three components

: 1. the true β 0 2. the constant impact of any specification errors (an omitted variable, for example) 3. the mean of ε for the correctly specified equation (if not equal to zero) • Unfortunately, these components can’t be distinguished from one another because we can observe only β 0 , the sum of the three components • As a result of this, we usually

don’t interpret

the constant term • On the other hand, we should

not suppress

either, as illustrated by Figure 7.1 the constant term,

7-2 © 2011 Pearson Addison-Wesley. All rights reserved.

Figure 7.1 The Harmful Effect of Suppressing the Constant Term

© 2011 Pearson Addison-Wesley. All rights reserved.

7-3

Alternative Functional Forms

• • • An equation is

linear

in the

variables

and Y generates a

straight line

if plotting the function in terms of X For example, Equation 7.1: Y = β 0 + β 1 X + ε

is

linear in the variables but Equation 7.2: Y = β 0 + β 1 X 2 + ε is

not

linear in the variables (7.1) (7.2) Similarly, an equation is

linear

in the

coefficients

appear in their simplest form —they: only if the

coefficients

– are

not raised

to any powers (other than one) – are

not multiplied

or

divided

by other coefficients – do

not

themselves

include

some sort of

function

(like

logs

or

exponents

)

© 2011 Pearson Addison-Wesley. All rights reserved.

7-4

Alternative Functional Forms (cont.)

• For example, Equations 7.1 and 7.2

are linear

in the coefficients, while Equation 7:3: (7.3) is

not linear

in the coefficients • In fact, of

all possible equations

for a single explanatory variable,

only

functions of the general form: (7.4) are linear in the coefficients β 0 and β 1

7-5 © 2011 Pearson Addison-Wesley. All rights reserved.

Linear Form

• This is based on the assumption that the slope of the relationship between the independent variable and the dependent variable is constant: • For the linear case, the

elasticity

of Y with respect to X (the percentage change in the dependent variable caused by a 1-percent increase in the independent variable, holding the other variables in the equation constant) is:

© 2011 Pearson Addison-Wesley. All rights reserved.

7-6

What Is a Logarithm?

• • • • • If

e log

(a

constant

of x: equal to 2.71828) to the “

b

th power” produces x, then b is the b is the log of x to the base e if: e b = x Thus, a log (or logarithm) is the exponent to which a given base must be taken in order to produce a specific number While logs come in more than one variety, we’ll use only natural logs (logs to the base e) in this text The symbol for a natural log is “ln,” so ln(x) = b means that (2.71828) more simply, b = x or, ln(x) = b

means that

e b = x For example, since e 2 = (2.71828) 2 = 7.389, we can state that: ln(7.389) = 2

7-7 © 2011 Pearson Addison-Wesley. All rights reserved.

Double-Log Form

• Assume the following: Y i   0  X X e 1i 1  2i 2  i • Yields: ln Y i ln 0 1 ln X 1i  2 X 2i   i • Here, the natural log of Y is the dependent variable and the natural log of X is the independent variable: • In a double-log equation, an individual regression coefficient can be interpreted as an

elasticity

because: • Note that the

elasticities

of the model are

constant not

and the

slopes

are • This is in

contrast

to the

© 2011 Pearson Addison-Wesley. All rights reserved.

linear model

, in which the

slopes constant

but the

elasticities

are

not

are

7-8

Figure 7.2 Double-Log Functions

© 2011 Pearson Addison-Wesley. All rights reserved.

7-9

Semilog Form

• The

semilog

functional form is a variant of the double log equation in which some but not all of the variables (dependent and independent) are expressed in terms of their natural logs. • It can be on the

right-hand

side, as in: Y i = β 0 + β 1 lnX 1i + β 2 X 2i + ε i • Or it can be on the

left-hand

side, as in: lnY = β 0 + β 1 X 1 + β 2 X 2 + ε • Figure 7.3 illustrates these two different cases (7.7) (7.9)

© 2011 Pearson Addison-Wesley. All rights reserved.

7-10

Figure 7.3 Semilog Functions

© 2011 Pearson Addison-Wesley. All rights reserved.

7-11

Polynomial Form

Polynomial functional

forms express Y as a function of independent variables, some of which are raised to powers other than 1 • For example, in a

second-degree

polynomial (also called a quadratic) equation, at least one independent variable is

squared

: Y i = β 0 + β 1 X 1i + β 2 (X 1i ) 2 + β 3 X 2i + ε i • The

slope

of Y with respect to X 1 in Equation 7.10 is: (7.10) (7.11) • Note that the

slope

depends on the

level

of X 1

© 2011 Pearson Addison-Wesley. All rights reserved.

7-12

Figure 7.4 Polynomial Functions

© 2011 Pearson Addison-Wesley. All rights reserved.

7-13

Inverse Form

• The

inverse

functional form expresses Y as a function of the

reciprocal

(or

inverse

) of one or more of the independent variables (in this case, X 1 ): Y i = β 0 + β 1 (1/X 1i ) + β 2 X 2i + ε i (7.13) • So X 1

cannot equal zero

• This functional form is relevant when the impact of a particular independent variable is expected to approach zero as that independent variable approaches infinity • The slope with respect to X 1 is: (7.14) • The slopes for X 1 of β 1 fall into

two categories

(illustrated in Figure 7.5) , depending on the sign

© 2011 Pearson Addison-Wesley. All rights reserved.

7-14

Figure 7.5 Inverse Functions

© 2011 Pearson Addison-Wesley. All rights reserved.

7-15

Table 7.1 Summary of Alternative Functional Forms

© 2011 Pearson Addison-Wesley. All rights reserved.

7-16

Lagged Independent Variables

• Virtually all the regressions we’ve studied so far have been

“instantaneous”

in nature • In other words, they have included independent and dependent variables from the

same time period

, as in: Y t = β 0 + β 1 X 1t + β 2 X 2t + ε t (7.15) • Many econometric equations include one or more

lagged independent variables

the observation of X 1 like X 1t-1 where “t–1” indicates that is from the time period previous to time period t, as in the following equation: Y t = β 0 + β 1 X 1t-1 + β 2 X 2t + ε t

© 2011 Pearson Addison-Wesley. All rights reserved.

(7.16)

7-17

Using Dummy Variables

• A dummy variable is a variable that takes on the values of 0 or 1, depending on whether a condition for a qualitative attribute (such as gender) is met • These conditions take the

general form:

(7.18) • This is an example of an

intercept dummy

(as opposed to a

slope dummy

, which is discussed in Section 7.5) • Figure 7.6 illustrates the consequences of including an intercept dummy in a linear regression model

© 2011 Pearson Addison-Wesley. All rights reserved.

7-18

Figure 7.6 An Intercept Dummy

© 2011 Pearson Addison-Wesley. All rights reserved.

7-19

Slope Dummy Variables

• Contrary to the

intercept dummy

, which changed only the intercept (and not the slope), the

slope dummy

changes both the intercept and the slope • The

general form

of a

slope dummy equation

is: Y i = β 0 + β 1 X i + β 2 D i + β 3 X i D i + ε i • The slope depends on the value of D: When D = 0, ΔY/ΔX = β 1 When D = 1, ΔY/ΔX = (β 1 + β 3 ) • Graphical illustration of how this works in Figure 7.7

(7.20)

© 2011 Pearson Addison-Wesley. All rights reserved.

7-20

Figure 7.7 Slope and Intercept Dummies

© 2011 Pearson Addison-Wesley. All rights reserved.

7-21

Problems with Incorrect Functional Forms

• If functional forms are similar, and if theory does not specify exactly which form to use, there are at least two reasons why we should avoid using goodness of fit over the sample to determine which equation to use: 1.

Fits are difficult to compare if the dependent variable is transformed 2.

An incorrect function form may provide a reasonable fit

within the sample

have the potential to make large forecast errors when used

outside

but the range of

the sample

• The first of these is essentially due to the fact that when the dependent variable is transformed, the

total sum of squares

(TSS)

changes

as well • The second is essentially die to the fact that using an incorrect functional amounts to a specification error similar to the omitted variables bias discussed in Section 6.1

• This second case is illustrated in Figure 7.8

© 2011 Pearson Addison-Wesley. All rights reserved.

7-22

Figure 7.8a Incorrect Functional Forms Outside the Sample Range © 2011 Pearson Addison-Wesley. All rights reserved.

7-23

Figure 7.8b Incorrect Functional Forms Outside the Sample Range © 2011 Pearson Addison-Wesley. All rights reserved.

7-24

Key Terms from Chapter 7

• Elasticity • Double-log functional form • Semilog functional form • Polynomial functional form • Inverse functional form • Slope dummy • Natural log • Omitted condition • Interaction term • Linear in the variables • Linear in the coefficients

7-25 © 2011 Pearson Addison-Wesley. All rights reserved.