The Value-Line Dow-Jones Model: Does It Have Predictive Content? Tom Fomby and Limin Lin Department of Economics SMU, Dallas, TX ISF 2004

Download Report

Transcript The Value-Line Dow-Jones Model: Does It Have Predictive Content? Tom Fomby and Limin Lin Department of Economics SMU, Dallas, TX ISF 2004

The Value-Line Dow-Jones Model:
Does It Have Predictive Content?
Tom Fomby and Limin Lin
Department of Economics
SMU, Dallas, TX
ISF 2004
A USEFUL RULE OF THUMB
• “Data Mining” by Michael Lovell (1983,
RES) The Lovell Pretesting Rule for
Coefficient Significance
• Start MLR with c candidate variables.
Use A Best Subset Method to obtain
A MLR with k final variables
P-Value(actual) = (c/k)*P-Value(stated)
A MLR PREDICTION
RULE OF THUMB
• “On the Usefulness of Macroeconomic Forecasts
as Inputs to Forecasting Models” Richard Ashley
Jo. of Forecasting. 1983
• Var(x(hat))/Var(x) versus 1
• Ratio greater than one, x generally not useful
• Ratio less than one, x possibly useful
• It seems a lot of practitioners ignore this rule at
their peril
The Value-Line Dow Jones Stock
Evaluation Model
 Regression model used by the Value-Line Corporation in
its end-of-year report (Value Line Investment Survey) to
provide its readers a forecast range for the Dow-Jones
Index in the coming years. (Model builder: Samuel
Eisenstadt)
1
2
3
 EPt   DPt 
 BYt 
DJ t   DJ t 1  
 
 

EP
DP
BY
 t 1   t 1 
 t 1 
DJ — Dow Jones Industrial Average,
EP —Earnings Per Share on the Dow Jones,
DP — Dividends Per Share on Dow Jones, and
BY — Moody’s AAA Corporate Bond Yield
 logarithm transformation linear form:
 ln( DJt )  0  1 ln( EPt )  2 ln( DPt )  3 ln( BYt )
Motivation
 No evaluation of the model in existing literature, although
the model is in use for over twenty years and possibly by
millions of readers who may have made decisions upon
forecasting results from the model. It would be interesting
and useful to see how precise and reliable these forecasts
are.
 Arguments in the literature about the forecasting
competence of regression model vs. univariate models, eg.
Ashley (1983). Accuracy of the model depends on the
accuracy of the forecasts of the independent variables. Are
the independent variables making the forecast better or
worse?
Outline of Presentation
Data
Stability Analysis
Out-of-Sample Forecast Evaluation
(Predictive Content of Input Variables)
Conclusions
Data
 Annual observations (1920-2002) on
•
•
•
•
DJ: Dow Jones Industrial Average, annual averages
EP: Earnings Per Share on the Dow (data point 1932 adjusted
for convenience of log transformation)
DP: Dividends Per Share on the Dow
BY: Moody’s AAA Corporate Bond Yield
 Data source: “Long Term Perspective” chart of the Dow
Jones Industrial Average, 1920-2002, published by the
Value Line Publishing, Inc. in Value Line Investment
Survey
 Logarithm transformation used to obtain linear regression
 Comparisons are made among forecasts of DJ
Dow Jones Industrial Average, Annual Average 1920-2002
12000
10000
8000
6000
DJ
In-Sample
(1920-1972)
Holdout Sample
(1973-2002)
4000
2000
Year
2001
1998
1995
1992
1989
1986
1983
1980
1977
1974
1971
1968
1965
1962
1959
1956
1953
1950
1947
1944
1941
1938
1935
1932
1929
1926
1923
1920
0
Earnings Per Share and Dividends Per Share on Dow Jones Industrial Average,
1920-2002
600
500
400
EP
300
DP
200
100
0
1920
1925
1930
1935
1940
1945
1950
1955
1960
Year
1965
1970
1975
1980
1985
1990
1995
2000
Moody's AAA Corporate Bond Yield, 1920-2002
16
14
12
10
% 8
BY
6
4
2
0
1920
1925
1930
1935
1940
1945
1950
1955
1960
Year
1965
1970
1975
1980
1985
1990
1995
2000
Stability Analysis for VLDJ Model:
Recursive Coefficients Diagrams
.20
.5
.15
.4
.10
.3
.05
.2
.00
.1
-.05
.0
-.10
-.1
-.15
-.2
1940
1950
1960
1970
1980
Recursive C(1) Estimates
1990
2000
1940
± 2 S.E.
1950
1960
1970
1980
Recursive C(2) Estimates
1.6
1990
2000
± 2 S.E.
3
2
1.2
1
0.8
0
-1
0.4
-2
0.0
-3
-0.4
-4
1940
1950
1960
1970
1980
Recursive C(3) Estimates
1990
± 2 S.E.
2000
1940
1950
1960
1970
1980
Recursive C(4) Estimates
1990
2000
± 2 S.E.
As reported in end of year ValueLine Investment Survey, coefficients are estimated as follows:
2002: (1.030, 0.210, 0.350, -0.413);
1999: (1.034, 0.217, 0.332, -0.468);
2001: (1.032, 0.218, 0.336, -0.463);
1998: (1.032, 0.216, 0.335, -0.473), and so on.
2000: (1.033, 0.214, 0.340, -0.480);
Stability Analysis for VLDJ Model:
CUSUM and CUSUMSQ Test Results
30
1.2
20
1.0
0.8
10
0.6
0
0.4
-10
0.2
-20
0.0
-30
30
40
50
60
70
80
90
00
-0.2
30
CUSUM
40
50
60
70
CUSUM of Squares
The CUSUM test is based on the statistic:
Wt 
Where
t
90
00

r
t
T
Wt  (   ) /(  r2 )
/s
is recursive residual defined as
5% Significance
The CUSUMSQ test is based on the statistic:
t
r  k 1
80
5% Significance
r  k 1
t 
2
r
r  k 1
( yt  xt ' b)
1  x '( X
t
' X t ) xt 
1
t
S is the standard error of the regression fitted to all T sample points.
1/ 2
Test for Structural Change of Unknown Timing:
Wald Test Sequence as a Function of Break Date
Andrews (1993, 2003) critical values
The Models for “DLDJ”
(specified using in-sample data only)
 Transfer function model (in same form as the Value-Line
Model): DLDJ at time t is a function of DLEP, DLDP and
DLBY at time t ; where
DLEP ~ MA(2) , DLDP ~ MA(1) and DLBY ~ AR (1)
 Box-Jenkins univariate model: DLDJ ~ MA(1).
 Note: Transform Predictions for DLDJ to DJ in two steps:
ˆ  LDJ  DLDJ
ˆ
ˆ
Step 1:
LDJ
 DLDJ
t ,h
t
t ,1 
t  h 1,1
^
Step 2:

2
ˆ
ˆ
DJt ,h  exp LDJt ,h  0.5 h

Ex-Ante Forecast Accuracy—
Transfer Function vs. Box-Jenkins
(Imperfect Foresight)
Forecast Horizon
Horizon1
Horizon2
Horizon3
Horizon4
Horizon5
Horizon6
RMSFE
TF
605.30
1,046.46
1,405.36
1,645.72
1,884.64
2,090.61
Increase in RMSE
(H6-H1)/H1
245.38%
BJ
528.40
999.61
1,355.96
1,470.20
1,719.23
1,870.55
254.00%
Accuracy Ranking
TF
BJ
2
1
2
1
2
1
2
1
2
1
2
1
Usefulness of Explanatory Variables
in the Transfer Function Model—
MSFE( xˆt ) / VAR( xt )
Forecast Horizon Explanatory Variables in the TF
DLEP
DLDP
Horizon1
0.90621
1.04771
Horizon2
1.06544
1.14441
Horizon3
1.05522
1.13803
Horizon4
1.02157
1.15946
Horizon5
1.00511
1.18731
Horizon6
0.96534
1.22971
Ashley(1983 )
1 82
MSFE ( xˆt )   ( xt 1  xˆt ,1 )2
30 t 52
1 83
2
VAR( xt )    xt  x 
30 t 53
Model
DLBY
0.95627
1.11007
1.03032
0.96935
0.97723
1.00496
Forecast Accuracy--RMSFE: Assuming
Perfect Foresight for Leading Indicators in
Transfer Function Model
Forecast Horizon TF-Perfect
TF-Imperfect
RMSFE RMSFE Disadvantage*
Horizon1
607
605
99.69%
Horizon2
980
1,046
106.73%
Horizon3
1,175
1,405
119.59%
Horizon4
1,295
1,646
127.04%
Horizon5
1,490
1,885
126.46%
Horizon6
1,575
2,091
132.77%
RMSFE
528
1,000
1,356
1,470
1,719
1,871
BJ
Accuracy Ranking
Disadvantage* TF-Perfect BJ
87.02%
2
1
101.95%
1
2
115.38%
1
2
113.49%
1
2
115.36%
1
2
118.80%
1
2
*Disadvantage: Loss of forecast accuracy relative to TF-Perfect
Value Line Forecasts vs. TF and BJ Forecasts
Year
actual DJ
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
MAFE*
RMSFE
1190
1178
1330
1797
2264
2062
2510
2670
2933
3282
3565
3735
4494
5740
7448
8631
10482
10731
10209
9214
ValueLine
DJ Forecast
1180
1390
1290
1435
2035
2425
2190
2710
2940
3445
N/A
N/A
N/A
5110
6415
8400
9800
11800
11800
10900
510
727
1-step-ahead
TF
BJ
972
906
1,335
1,368
1,230
1,202
1,421
1,451
2,002
2,034
2,355
2,497
2,107
2,086
2,645
2,819
2,779
2,817
3,234
3,172
3,647
3,546
3,785
3,824
3,872
3,975
4,674
4,977
6,001
6,412
7,757
8,368
9,327
9,396
11,056
11,681
11,178
11,327
11,096
10,700
593
555
807
704
2-step-ahead
TF
BJ
1,020
1,051
1,069
964
1,414
1,465
1,286
1,283
1,517
1,550
2,149
2,185
2,455
2,689
2,175
2,239
2,780
3,035
2,956
3,030
3,590
3,414
3,844
3,819
4,055
4,118
4,029
4,278
4,896
5,368
6,322
6,935
8,201
9,076
10,022
10,197
11,701
12,701
11,922
12,297
985
928
1,386
1,322
*The MAFE and RMSFE are computed based on years 1983-2002 except 1993-1995
Combination Forecasts of TF and BJ
 Simple Average (CF1)
 Nelson Combination (CF2)
 Granger-Ramanathan Combination (CF3)
 Fair-Shiller Combination (CF4)
 Note: We apply dynamic weights
Forecast Accuracy (RMSFE)—
Box-Jenkins vs. Combinations
RMSFE
Horizon1
Horizon2
Horizon3
Horizon4
Horizon5
Horizon6
(H6-H1)/H1
rank in increase
BJ
528
1,000
1,356
1,470
1,719
1,871
254%
1
CF1
559
1,011
1,368
1,549
1,790
1,973
253%
2
CF2
711
1,459
1,964
2,245
2,321
2,373
234%
3
CF3
627
1,134
1,656
1,394
1,548
1,279
104%
5
CF4
649
1,058
1,291
1,288
1,569
1,381
113%
4
RMSFE Ranking
Horizon1
Horizon2
Horizon3
Horizon4
Horizon5
Horizon6
Overall Score
Overall Ranking
BJ
1
1
2
3
3
3
2.17
1
CF1
2
2
3
4
4
4
3.17
4
CF2
5
5
5
5
5
5
5.00
5
CF3
3
4
4
2
1
1
2.50
3
CF4
4
3
1
1
2
2
2.17
1
Ways of Combating Weak Input Variables
 Drop input variables that don’t satisfy Ashley’s Criterion
(Forecast could have bias but less variance)
 Use improved input variables: Combination of sample
mean and forecasts of input variable
-- Simple average
-- Ashley (1985) combination
Forecast Accuracy—
Dropping Inadequate Input Variables
TF
Forecast Horizon Input Variables
Horizon1 DLEP and DLBY
Horizon2
N/A
Horizon3
N/A
Horizon4
DLBY
Horizon5
DLBY
Horizon6
DLEP
MAE
RMSE
394.56 633.69
N/A
N/A
N/A
N/A
917.41 1,535.99
1,118.73 1,755.99
1,426.34 2,139.76
Compare with BJ
MAE RMSE
351.91 528.4
604.64 999.61
764.41 1,355.96
895.69 1,470.20
1,092.55 1,719.23
1,198.63 1,870.55
Forecast Accuracy—
Input Variables From Combination Forecasts
Forecast
Leading Indicators:
Leading Indicators:
Compare with
Leading Indicators:
Horizon
Simple Average
MAFE RMSFE
376.43
596.93
641.95 1,058.13
824.00 1,385.89
997.90 1,616.70
1,241.54 1,912.19
1,452.05 2,159.12
Ashley Combination
MAFE RMSFE
389.28
611.46
721.76 1,112.25
834.19 1,386.82
995.05 1,596.70
1,261.50 1,886.64
1,458.71 2,129.24
ARIMA
MAFE RMSFE
378.85
605.30
630.54 1,046.46
822.84 1,405.36
998.43 1,645.72
1,189.42 1,884.64
1,385.33 2,090.61
Horizon1
Horizon2
Horizon3
Horizon4
Horizon5
Horizon6
Conclusions
 In the absence of perfect foresight, TF (Value Line) forecasts are less
accurate than the BJ benchmark forecasts for any forecast horizons.
 Ashley (1983) criterion shows that the leading indicators are very
noisy and inhibit ex ante forecasting accuracy of TF model.
 If future values of leading indicator variables are assumed known,
(perfect foresight), TF forecasts improve considerably--beat the BJ
forecast for 2-6 step-ahead forecast horizons, but do not for the 1-stepahead forecast horizon.
Conclusion (cont.)
 With respect to Ex Ante combination forecasting, BJ forecasts perform
better for short horizons and combinations of the TF and BJ are best
for longer horizons.
 For Ex Ante forecasts, differences in accuracy between TF forecasts
and the most accurate forecasts are not statistically significant. Ashley
(2003)
 Dynamic Combination forecasts perform better than combinations with
fixed weights.
 Dropping inadequate input variables did not improve forecast
accuracy. Using combination forecasts for the input variables only
improved the forecast accuracy of some horizons.
Conclusion (cont.)
 Evidently, the Value Line personnel have been pretty astute with
respect to choosing future values of the independent variables of their
model. Their published 1-step-ahead forecasts have smaller MAFE
than the ex ante TF model and the BJ model. With respect to the
RMSFE, however, the BJ model provides a more accurate 1-stepahead-forecast.
 Remember forecasting accuracy is only one way to evaluate the VLDJ
model. Irrespective of its forecasting powers, it should be recognized
that the VLDJ model is potentially quite useful for examining “what
if” scenarios and understanding historical causal factors in the stock
market.
 It would be interesting to compare competing models based on interval
forecast accuracy and density forecast accuracy.
Thank you!
References
• Andrews, D. W. K. (1993): “Tests for Parameter Instability and
Structural Change with Unknown Change Point,” Econometrica, 61,
821-856.
• Andrews, D. W. K. (2003): “Tests for Parameter Instability and
Structural Change with Unknown Change Point: A Corrigendum,”
Econometrica, 71 (1), 395-397.
• Ashley, R. (1983): “On the Usefulness of Macroeconomic Forecasts as
Inputs to Forecasting Models,” Journal of Forecasting, 2, 211-223.
• Ashley, R. (2003): “Statistically Significant Forecasting
Improvements: How Much Out-of-Sample Data Is Likely Necessary?”
International Journal of Forecasting, 19(2), 229-239.
• Bai, J. (1997): “Estimation of A Change Point in Multiple Regression
Models,” Review of Economics and Statistics, 79 (4), 551-563.
References (cont.)
• Brown, R. L., J. Durbin, and J. M. Evans (1975): "Techniques for
Testing the Constancy of Regression Relationships Over Time,"
Journal of the Royal Statistical Society, Series B, 37, 149-192.
• Diebold, F. X. and R. S. Mariano (1995): “Comparing Predictive
Accuracy,” Journal of Business and Economic Statistics, 13 (3), 253263.
• Fair, R. C. and R. J. Shiller (1990): “Comparing Information in
Forecasts from Econometric Models,” American Economic Review, 80
(3), 375-389.
• Nelson, C. R. (1972): “The Prediction Performance of the FRB-MITPENN Model of the U.S. Economy,” American Economic Review, 62
(5), 902-917.
Combinations of the TF and BJ models
• Naïve combination: simple average (weight=0.5)
CF1t ,h  0.5BJt ,h  0.5TFt ,h
In-sample
Out-of-sample
(obs. 1-53)
(obs. 54-83)
Combinations of the TF and BJ models
(dynamic weights applied)
• Dynamic Nelson combination (weights sum to 1)
CF 2t ,h  ˆt ,h BJt ,h  (1 ˆt ,h )TFt ,h
where weight is obtained from LS regression
(DJt h  TFt ,h )  t ,h (BJt ,h  TFt ,h )  t
15 obs.
Training
Test Data
Validation
(Out-of-sample)
15 obs.
Validation
…
Combinations of the TF and BJ models
(dynamic weights applied)
• Dynamic Granger-Ramanathan combination (weights
obtained from unrestricted regression)
CF 3t ,h  ˆt ,h  ˆt ,h BJt ,h  ˆt ,hTFt ,h
where weights are obtained from regression
DJt h  t ,h  t ,h BJt ,h   t ,hTFt ,h  t
• Dynamic Fair and Shiller Combination
CF 4t ,h  DJ t   t ,h  t ,h  BJ t ,h  DJ t    t ,h TFt ,h  DJ t 
where weights are obtained from regression
DJ t  h  DJ t   t ,h  t ,h  BJ t ,h  DJ t    t ,h TFt ,h  DJ t    t
Data --LDJ
Dow Jones Industrial Average, Annual Average in
Logarithm
10
9
8
7
6
LDJ
5
4
3
2
1
Year
2000
1996
1992
1988
1984
1980
1976
1972
1968
1964
1960
1956
1952
1948
1944
1940
1936
1932
1928
1924
1920
0
Year
2000
1996
1992
1988
1984
1980
1976
1972
1968
1964
1960
1956
1952
1948
1944
1940
1936
1932
1928
1924
1920
Data --LEP
Earnings Per Share on the Dow in Logarithm
7
6
5
4
LEP
3
2
1
0
Year
2000
1996
1992
1988
1984
1980
1976
1972
1968
1964
1960
1956
1952
1948
1944
1940
1936
1932
1928
1924
1920
Data --LDP
Dividends Per Share on the Dow in Logarithm
6
5
4
3
LDP
2
1
0
Year
2000
1996
1992
1988
1984
1980
1976
1972
1968
1964
1960
1956
1952
1948
1944
1940
1936
1932
1928
1924
1920
Data --LBY
Moody's AAA Bond Yields in Logarithm
3
2.5
2
1.5
LBY
1
0.5
0
Figure 12. First Difference of Log Dow Jones Industrial Average
0.4
0.2
-0.4
DLDJ
-0.6
-0.8
-1
Year
2001
1997
1993
1989
1985
1981
1977
1973
1969
1965
1961
1957
1953
1949
1945
1941
1937
1933
1929
1925
-0.2
1921
0
0
-1
-2
-3
-4
-5
Year
2001
1997
2
1993
1989
1985
1981
1977
1973
1969
1965
1961
1957
1953
1949
1945
1941
1937
1933
1929
1925
1921
Figure 13. First Difference of Log Earnings Per Share
4
3
DLEP
1
Figure 14. First Difference of Log Dividends Per Share
0.6
DLDP
0.4
0.2
-0.4
-0.6
-0.8
Year
2001
1997
1993
1989
1985
1981
1977
1973
1969
1965
1961
1957
1953
1949
1945
1941
1937
1933
1929
1925
-0.2
1921
0
Figure 15. First Difference of Log AAA Bond Yield
0.25
0.2
0.15
DLBY
0.1
-0.15
-0.2
-0.25
-0.3
Year
2001
1997
1993
1989
1985
1981
1977
1973
1969
1965
1961
1957
1953
1949
1945
1941
1937
1933
1929
1925
-0.05
-0.1
1921
0.05
0