Exponential smoothing: The state of the art Everette S

Download Report

Transcript Exponential smoothing: The state of the art Everette S

Exponential smoothing:
The state of the art – Part II
Everette S. Gardner, Jr.
1
Exponential smoothing:
The state of the art – Part II







2
History
Methods
Properties
Method selection
Model-fitting
Inventory control
Conclusions
Timeline of Operations Research (Gass,
2002)
1654
1733
1763
1788
1795
1826
1907
1909
1936
1941
1942
1943
1944
1944
3
Expected value, B. Pascal
Normal distribution, A. de Moivre
Bayes Rule, T. Bayes
Lagrangian multipliers, J. Lagrange
Method of Least Squares, C. Gauss, A. Legendre
Solution of linear equations, C. Gauss
Markov chains, A. Markov
Queuing theory, A. Erlang
The term OR first used in British military applications
Transportation model, F. Hitchcock
U.K. Naval Operational Research, P. Blackett
Neural networks, W. McCulloch, W. Pitts
Game theory, J. von Neumann, O. Morgenstern
Exponential smoothing, R. Brown
Exponential smoothing at work
“A depth charge has a
magnificent laxative
effect on a submariner.”
Lt. Sheldon H. Kinney,
Commander,
USS Bronstein (DE 189)
4
Forecast Profiles
N
None
N
None
A
Additive
DA
Damped Additive
M
Multiplicative
DM
Damped Multiplicative
5
A
Additive
M
Multiplicative
Damped multiplicative trends (Taylor,
1.00 0.95
0.90
2002)
4,000
Damping
paramete
r
3,000
2,000
1,000
6
0.85
Variations on the standard methods










7
Multivariate series (Pfefferman & Allen, 1989)
Missing or irregular observations (Wright,1986)
Irregular update intervals (Johnston, 1993)
Planned discontinuities (Williams & Miller, 1999)
Combined level/seasonal component (Snyder & Shami, 2001)
Multiple seasonal cycles (Taylor, 2003)
Fixed drift (Hyndman & Billah, 2003)
Smooth transition exponential smoothing (Taylor, 2004)
Renormalized seasonals (Archibald & Koehler, 2003)
SSOE state-space equivalent methods (Hyndman et al., 2002)
Smoothing with a fixed drift
(Hyndman & Billah, 2003)

Equivalent to the “Theta method”?
(Assimakopoulos and Nikolopoulos, 2000)

How to do it




When to do it

8
Set drift equal to half the slope of a regression on time
Then add a fixed drift to simple smoothing, or
Set the trend parameter to zero in Holt’s linear trend
Unknown
Adaptive simple smoothing (Taylor,
2004)
 Smooth transition exponential smoothing (STES)
is the only adaptive method to demonstrate
credible improved forecast accuracy
9

The adaptive parameter changes according to a
logistic function of the errors

Model-fitting is necessary
Renormalization of seasonals

Additive (Lawton, 1998)

Without renormalization



Renormalization of seasonals alone


Forecasts are biased unless renormalization is done
every period
Multiplicative (Archibald & Koehler, 2003)

10
Level and seasonals are biased
Trend and forecasts are unbiased
Competing renormalization methods give forecasts
different from each other and from unnormalized
forecasts
Archibald & Koehler (2003) solution
11

Additive and multiplicative renormalization
equations that give the same forecasts as
standard equations

Cumulative renormalization correction factors for
those who wish to keep the standard equations
Continental Airlines Domestic Yields
0.15
0.14
0.13
0.12
Model
Restarted
0.11
0.10
Jan-00
12
Jan-01
Jan-02
Jan-03
Jan-04
Jan-05
Standard vs. state-space methods

Trend damping



Multiplicative seasonality



Standard: Seasonal component depends on level
State-space: Independent components
Model fitting


13
Standard: Immediate
State-space: Starting at 2 steps ahead
Standard: Minimize squared errors
State-space: Minimize squared relative errors if
multiplicative errors are assumed.
Properties



14
Equivalent models
Prediction intervals
Robustness
Equivalent models

Linear methods





All methods

15
ARIMA
DLS regression
Kernel regression (Gijbels et al.,1999; Taylor, 2004)
MSOE state-space models (Harvey, 1984)
SSOE state-space models (Ord et al.,1997)
Analytical prediction intervals

Options



Empirical evidence

16
SSOE models (Hyndman et al., 2005)
Model-free (Chatfield & Yar, 1991)
None
Empirical prediction intervals

Options




Chebyshev distribution (fitted errors) (Gardner, 1988)
Quantile regression (fitted errors) (Taylor & Bunn, 1999)
Parametric bootstrap (Snyder et al., 2002)
Simulation from assumed model (Bowerman, O’Connell,
& Koehler, 2005)

Empirical evidence

17
Limited, but encouraging
Robustness

Many equivalent models for each method
(Chatfield et al., 2001; Koehler et al., 2001)

Simple ES performs well in many series that
are not ARIMA (0,1,1) (Cogger,1973)

Aggregated series can often be approximated
by ARIMA (0,1,1) (Rosanna & Seater, 1995)
18
Robustness (continued)

Exponentially declining weights are robust
(Muth, 1960; Satchell & Timmerman, 1995)

Additive seasonal methods are not sensitive
to the generating process (Chen,1997)

The damped trend includes numerous
special cases (Gardner & McKenzie,1988)
19
Automatic forecasting with the damped additive
trend
 = .84
 = .38
20
 = 1.00
Summary of 66 empirical studies,
1985-2005





21
Seasonal methods rarely used
Damped trend rarely used
Multiplicative trend never used
Little attention to method selection
But exponential smoothing was robust,
performing well in at least 58 studies
Method selection






22
Benchmarking
Time series characteristics
Expert systems
Information criteria
Operational benefits
Identification vs. selection
Benchmarking in method selection



23
Methods should be compared to reasonable
alternatives
Competing methods should use exactly the
same information
Forecast comparisons should be genuinely
out of sample
Method selection: Time series
characteristics

Variances of differences




(Shah,1997)
Considered only simple smoothing and a linear trend
Should be tested with an exponential smoothing
framework
Regression-based performance index


24
Seemed a good idea at the time
Discriminant analysis

(Gardner & McKenzie,1988)
(Meade, 2000)
Considered every feasible time series model
Should be tested with an exponential smoothing
framework
Method selection: Expert systems

Rule-based forecasting




Original version (Collopy & Armstrong, 1992)
Automatic version (Vokurka et al., 1996)
Streamlined version (Adya et al., 2001)
Other rule-induction systems
(Arinze,1994; Flores & Pearce, 2000)

25
Expert systems are no better than aggregate
selection of the damped trend alone (Gardner, 1999)
Method selection: AIC
Damped trend vs. state-space models selected by AIC:
Average of all forecast horizons
Damped trend
20
State-space
18
16
14
12
10
8
111
1,001 M3 Ann. M3 Qtr. M3 Mon.
MAPE
26
Asymmetric MAPE
Method selection:
Empirical information criteria (EIC)
27

Strategy: Penalize the likelihood by linear
and nonlinear functions of the number of
parameters (Billah et al., 2005)

Evaluation: EIC superior to other
information criteria, but results are not
benchmarked
Method selection: Operational benefits
28

Forecasting determines inventory costs,
service levels, and scheduling and
staffing efficiency.

Research is limited because a model of
the operating system is needed to project
performance measures.
Method selection: Operational benefits
(cont.)

Manufacturing (Adshead & Price, 1987)



U.S. Navy repair parts (Gardner, 1990)



29
Producer of industrial fasteners (£4 million
annual sales)
Costs: holding, stockout, overtime
50,000 inventory items
Tradeoffs: Backorder delays vs. investment
Savings: $30 million (7%) in investment
Average delay in filling backorders
50
Random walk
Backorder days
45
Linear trend
40
35
Simple smoothing
30
Damped trend
25
370
380
390
400
410
Inventory investment (millions)
30
420
430
Inventory analysis: Packaging materials for
snack-food manufacturer
$2,500,000
$2,000,000
Actual Inventory
from subjective
forecasts
$1,500,000
$1,000,000
$500,000
$0
Target maximum
inventory based on
damped trend
31
Month
Month
Monthly Usage
Method selection: Operational benefits
(cont.)

Electronics components (Flores et al., 1993)



RAF repair parts (Eaves & Kingsman, 2004)



32
967 inventory items
Costs: holding cost vs. margin on lost sales
11,203 inventory items
Tradeoffs: inventory investment vs. stockouts
Savings: £285 million (14%) in investment
Forecasting for inventory control:
Cumulative lead-time demand


33
SSOE models yield standard deviations of
cumulative lead-time demand (Snyder et al., 2004)
Differences from traditional expressions (such
as s Lead time ) are significant
Standard deviation multipliers, α = 0.30
Traditional
Correct
5
4
3
2
1
0
2
3
4
Lead time
34
5
6
Forecasting for inventory control:
Cumulative lead-time demand (cont.)

The parametric bootstrap (Snyder et al., 2002)
can estimate variances for:




35
Any seasonal model
Non-normal demands
Intermittent demands
Stochastic lead times
Forecasting for inventory control:
Intermittent demand

Croston’s method (Croston, 1972)
Mean demand

=
Smoothed nonzero demand
Smoothed inter-arrival time
Bias correction (Eaves & Kingsman, 2004;
Syntetos & Boylan, 2001, 2005)
Mean demand x (1 – α / 2)
36
Forecasting for inventory control:
Intermittent demand (continued)

There is no stochastic model for Croston’s
method (Shenstone & Hyndman, 2005)



37
Many questionable variance expressions in the
literature
The state-space model for intermittent series requires
a constant mean inter-arrival time (Snyder, 2002)
Why not aggregate the data to eliminate zeroes?
Progress in the state of the art, 19852005
 Analytical variances are available for most





38
methods through SSOE models.
Robust methods are available for multiplicative
trends and adaptive simple smoothing.
Croston’s method has been corrected for bias.
Confusion about renormalization of seasonals has
finally been resolved.
There has been little progress in method selection.
Much empirical work remains to be done.
Suggestions for research

Refine the state-space framework




Validate and compare method selection
procedures



39
Add the damped multiplicative trend
Damp all trends immediately
Test alternative method selection procedures
Information criteria – Benchmark the EIC
Discriminant analysis
Regression-based performance index
Suggestions for research (continued)

Develop guidelines for the following choices:






40
Damped additive vs. damped multiplicative trend
Fixed vs. adaptive parameters in simple smoothing
Fixed vs. smoothed trend in additive trend model
Standard vs. state-space seasonal components
Additive vs. multiplicative errors
Analytical vs. empirical prediction intervals
Conclusion
“The challenge for future research is to
establish some basis for choosing among
these and other approaches to time series
forecasting.” (Gardner,1985)
41