Transcript A range of methods for electrical consumption forecasting.
1 - -
A range of methods for electrical consumption forecasting Energy Systems Week April 22th: Xavier Brossat EDF/RD/Dpt OSIRIS
A range of methods for electrical consumption forecasting.
For Electricité de France the forecast of electricity consumption is a fundamental problem which has been studied for the last twenty years. It is necessary to be able to provide customers and at the same time, optimize the production at different horizons of time.
Results of operating models that use non linear regression or ARMAX methods are satisfying with a current accuracy of 1.5% for the forecast of the following day.
But, they have to be continually fitted to be adapted to some very difficult periods of time and to the change of consumption.
For a few years, due to the new competitive environment, the electrical load curve has become less regular. Its shape and level which depended essentially on climatic exogenous variables has become more affected by economical and ecological variables. The data is not always available and the time series used are often short.
So, we have tried to apply the following alternative methods to answer to problems like adaptivity, nonstationarity, parsimony, lack of data, necessity of forecast interval.
In this presentation we will display the operating models and those different classes of models which we applied to electrical consumption forecast. For each model we will present the method used, we will show some practical results
and we will discuss the benefits and drawbacks of it.(adaptive Kalmann, GAM, combining algoritms, KWF, Bayesian Methods, ..)
2 - -
Energy management
Supply-side uncertainties Investment decisions Fuel supplies System flexibilities Nuclear maintenance scheduling Long-term 20-50 years Medium-term 1-5 years Structuring of customer contracts Long-term demand Demand-side uncertainties G = D Stock management (H2O, Nuclear, large combustion plants, etc.) Daily generation plans Control and management of market risks THF, H2O maintenance Intra-day redeclarations Market arbitrages Short-term 1 hour – 1 day Load forecasts Load-shedding, nominations Strategy for the use of load-sheds and gas contracts
R&D skills and Energy Management
Management of market risks Financial mathematics, optimisation, risk management, Portfolio structuring Optimisation, quantitative economics, energy markets, econometrics Portfolio management Optimisation, electricity generation, gas logistics Portfolio optimisation Statistics, optimisation, econometrics, energy markets Energy markets and unknown factors Statistics, probabilities, load forecasts Load forecasting A common background: information technology
5 - -
Outline
1.
2.
3.
4.
Some Charateristics of electrical consumption in France Operational Model and his limits A range of methods for electrical consumption forecasting.
Future Work
6 - -
Some charateristics of electrical consumption in France
Electricity demand data
Various seasonal components.
High dependency on climate Other interspersed punctual events.
7 - -
Some characteristics of French electricity consumption
:
annual cycle
Daily Energy (Normal Temperature and nebulosity) from 2004/09/01to 2008/08/31 Special days and events: Christm as From 2006/09/01 to 2007/08/31 Mai Augu st
Some characteristics of French electricity consumption : temperature’s effect in winter
du 05 au 11 novembre 2007 du 12 au 18 novembre 2007
Some characteristics of French electricity consumption : special days and events
August, 15th Christmas January, 1st Montday to Friday Saturday Sunday 10
The economic crisis has an impact on French consumption on the period 2007-2009
Economic crisis 0 10 20 2002 2003 2004 2005 Instant 2006 2007 2008 30 40
A good quality of forecast
Problem studied during these last 25 years Forecasting horizon daily H=1:7*48 half an hour, one week, one year A current accuracy of 1. 2% (MAPE) for the forecast of the following day!
Why more?
Why more ?
Some very difficult periods (winter: with fast changes in temperature, special days and events: bank holidays, crisis, …) Due to the new competitive environment, forecasts must be more accurate.
Therefore customers are able to leave and join the company ( Non stationarity).
We have do to local forecast We have to take in account renewable energy, the evolving of uses We have to evaluate the uncertainties and provide forecast intervals Uncertainties about customer’s behavior , institutional mechanism, data acquisition, and socio-economic changes With keeping the good prediction performance of present methods despite the changing context (accuracy of 1.2% for the following day).
Why more ?
DATA Available data is changing :Split between provider and transporter Difficulties with measuring individual consumption Short past time series Different portfolios Different sampling New electricity meters : big data flows Renewable on the network
Portfolio to forecast varies
We have to forecast the consumption of EDF customers instead off french consumption Consumption’s process change and can becoming unstationnary Conso (MW) 80000 60000 50000 40000 30000 20000 10000 0 0 01/01 31/12 EDF EDF France France
EDF R&D : Créer de la valeur et préparer l’avenir - © EDF R&D 2011 19
Linky 35 millions of smarmeter
Customer’s behavior and use
Multimedia network ( now) Domotic network ( growing)
Concentrateur
Compteur Box
Client
Eau Chaude
Charges Production & Stockage VE/VEHR Telco Distributeur Fournisseur (d’énergie, de services)
EDF R&D : Créer de la valeur et préparer l’avenir - © EDF R&D 2011 20
Internet
Heating pump
La chaleur est prélevée sur l’air extérieur et est restituée sous forme:
d’eau chaude circulant dans les locaux (PAC air/eau) les émetteurs de chaleur, raccordés sur une boucle d’eau, peuvent être des planchers chauffants, un réseau de radiateurs et/ou de ventilo-convecteurs.
Plancher chauffant Radiateur Ventilo-convecteurs
d’air chaud envoyé dans les locaux (PAC air/air) les systèmes centralisés avec diffusion d’air chaud dans les faux plafonds par gaines ou par plenum. L’unité extérieure est reliée à un réseau d’air pulsé. Dans ce cas, le logement est équipé de bouches de soufflage et de grilles de reprise d’air.
A Key Issue: How Much Future Energy Efficiency?
Forecast demand can vary substantially from even a single variable, such as incremental energy efficiency, depending on the how it is incorporated.
5,500
SDG&E System Peak Load: Mid-Case Adjusted Forecast
Without Incremental Energy Efficiency 5,000 10% 4,500 With Incremental Energy Efficiency 4,000 3,500 Source: 2009 IEPR Forecast 3,000 19 96 19 97 19 98 19 99 20 00 20 01 20 02 20 03 20 04 20 05 20 06 20 07 20 08 20 09 20 10 20 11 20 12 20 13 20 14 20 15 20 16 20 17 20 18 20 19 20 20 Actual WN Actual Forecast without Uncommitted EE Total Adjusted Forecast
24 - -
Outline
•
The Operational Model and his limits
The operational model: model design
Outside temperatures in °C Non linear regression using S.G. Nash’s truncated Newton method Estimation based on several years with invalidated data: Breaking periods (summer holidays, Christmas holidays,…)
P
i
Wip
i
Wdp
i
SpeTar
i
i
The operational model: the weather independent part
For Hour h, of the Day d of the Year y :
h
,
y
,
day
_
type
(
d
) the load shape depending on the day type of d,
S h
the seasonality for h composed by Fourier series and dummy variables to cope with Daylight Savings 32000 27000 22000 17000 12000 0 DT1 4 DT2 8 DT3 12 DT4 16 DT5 20 DT6 24 28 DT7 32 DT8 36 DT9 40 DT10 44 DT11
Weather Independent Part
53000 48000 43000 38000 33000 1 31 61 91 121 151 181 211
Day of the year
241 271 301 331 361
Wip S h d h
,
d
,
y f h
f
h
d h
,
y
,
day q
_
h
,
type
(
d p
)
m
4 1
a m
,
h
cos
S h
(
d
) 2
md
365 .
25
b m
,
h
sin 2
md
365 .
25
The operational model: the weather dependent part
Using a temperature felt inside buildings as explanatory variable Weighted average of instantaneous temperatures and exponentially smoothed temperature Influence of cloud cover (lightning, green house effect)
STw h
,
d
,
y
( 1
h
).
T h
,
d
,
y
h
.
ST h
, ( 1 )
d
,
y
h
.( 8
CC h
,
d
,
y
)
ST h
, ( 1 )
d
,
y
( 1
h
).
T h
,
d
,
y
h
.
ST h
( 1 ) 1 ,
d
,
y
load/instant temperature load/temperature felt ins. build.
Historical and current ways of forecasting
The operational model: results
Estimation Results (real temperature) Middle-Term Forecast Results
[01/09/2000;31/08/2005] [ 01/09/2005;31/07/2006] RMSE
740 MW
MAPE
1.12%
AE
-3.6 MW
RMSE
817 MW
MAPE
1.16%
AE
27 MW
The operationnal model: comments
This is a sophisticated and efficient model. Take into account many aspects such as specific periods (Noel, 1 of may,…) Tricky to fit: a lot of parameters !
Parameters’ estimation by maximum likelihood Forecast Interval in progress using asymptotic parameters’ uncertainty and boostrap approaches.
For studies: simplified models such as GAM, Improvement: parsimony , adaptativity, optimization methods?
29
A range of methods for electrical consumption forecasting.
30
Solutions for adapting forecasting methods at EDF
Needs and possible solutions…
Adaptativity Move of customers Parsimony Kalman Kalmann Mixture of predictors Functional models GAM GAM KWF
Agregats
or
Global signal
uncertainties calculation Focus on Individual data and aggregate data a priori knowledge Forecast interval
1. State-space Models espace (adaptive Kalman) Derived from current models (METEHORE, ARMA) On line parameter identification (those of transfer function climatic data / consumption 2. Mixture of predictors Several predictors are used in parallel; Their optimal weighting in the mixture used for producing the final forecast is calculate by different algorithms.
3. GAM models ( Generalised Additif Models)
Non parametric approach: more flexibility, less a priori , «we let data speak for itself » Efficient algorithms: consumption estimated like a sum of function ( of lead consumption, temperature fitted by splines).
Well adapted for changes in the load curve An alternative to current parametrics models used currently A good tool for studies
4. Functional Models
Load curve is divided into functions These functions are projected in a wawelet basis. Similitudes between levels of decomposition are used for forecasting the functions.
Interest Each curve is an object instead of a time serie
Guaranteed temporal continuity Assurer la continuité temporelle
in particular for hourly forecasts and identify profils and their evolution Search forms of consumption at different frequencies (Kernel methods and wavelets decomposition).
Alternative to operational models with aim of simplification
Bayesian approach Methods allowed to combine two kinds of data: … and adapting forecasts with a priori information:
Prior impact: A prior distribution (given by the user) is combined with the data of a model to get a posterior distribution. Probabilistic predictions are derived naturally within the Bayesian workframe (credible intervals, HPD regions).
Poor historical data case: In case of too short historical data, usual forecasting methods are ineffective.
The use of a bayesian prior can improve the estimation of a model.
Some forecasting are needed on EDF portfolio subgroups. Some have very poor historical data. A prior can be built from a model previously estimated on a similar subgroup. One question is to know if the subgroups are « similar » enough.
A Kernel Wawelet Functional approach KWF
35
Some methods :models
KWF
Let (Z n ) be a stationarity functional valued process.
We predict next segment by Where (w n,m ) is a vector of weigths that increases when the segments
m
« similar ».
and
n
are wavelets MAPE %
The stationarity assumption is too strong for the power demand series .
Some methods :models
How do we compute the dissimilarity?
Monday WE After expanding two functions
l
,
m
compute for each scale
j
we Wavelets do discriminate We then aggregate the scales
We construct the prediction of wavelet scales using a kernel function.
KWF
Why stationarity fails? 2 keys: evolving mean level existance of groups KWF
MAPE
8,31% + Mean level correction 2,94% + Groups 1,64% Smooth approx.
Solutions:
of day m
Smooth approximation correction
Usage of groups of days Calendar information or unsupervised learning of classes
39 - -
Clusturing Functional Data Using wavelets
40
CWT and wavelet spectrum
41 - Different mid frequencies Wavelet spectrum of a electricity consumtion during Christmas day and a summer day.
Wavelet coherence:local correlation of the time-scale representation of 2 functions
Spectrum smoothed on time and scale Wavelet Extended R 2 coefficient Dissimilarity on a N time points and J scales.
42 - -
Wavelet spectrum of mean daily load Scales are in days
44 - -
Synthetic data – customers losses (StreamBase)
Mixture of experts
46
Some methods :models
Combining forecasts: an on-line forecasting problem
Individual predictors Adaptive Weights Combined forecast The challenge: reduce the cumulative loss of the combined
forecast, ideally lower than the best expert, or close to the best expert each time
Some methods :models
Basic ideas, basic algorithms
One parameter: A kind of « temperature » parameter AFTER algorithm: the higher the cumulative loss of a forecaster the lower the lower is its weight
Some methods :models French data, combination of opérationnal predictors
Different values of the weights depending on the « temperature » parameter
Some methods :models French data, combination of opérationnal predictors
Different values of the weights depending on the « temperature » parameter
Some methods :models French data set, operational predictors Performances
Gain : 140 MW EDF R&D – OSIRIS R39 –International Symposium of Forecasting 2010 1600 1400 1200 1000 800 600 400
RMSE mensuels 2008
PETRA (938 MW) PMIX_opé (924 MW) Gain de PMIX_dyn/PMIX_opé CPO (991 MW) PMIX_dyn (784 MW) 51 350 300 250 200 150 100 50 0 -50 -100
Some methods :models
Our development/improvements
Break detection and combining
New algorithms that are able to « follow » the best forecaster when it varies with time Goude, Y. (2008), "Tracking The Best Predictor With a Detection Based Algorithm", in JSM Proceedings, Denver, USA.
Goude, Y. (2009), "Adaptive Break Detection and Combining: Application to Electricity Load Forecast", in JSM Proceedings, Washington, USA.
Sleeping experts
We suppose that some experts can « sleep » during some period of time This setting suits particularly well with industrial applications New forecasters can appear with time During particularly hard to predict period of time (holidays, banking holidays …) some experts can’t produce any forecasts
Online estimation of the mixing parameters
M. Devaine (ENS Paris), Y. Goude (EDF R&D), G. Stoltz (HEC, CNRS), • Forecast of the electricity consumption by aggregation of specialized experts; application to Slovakian and French country wide hourly predictions‘ Submitted to JRSS –C
Adapted Predictors
Forecast Interval
EDF R&D – OSIRIS R39 –International Symposium of Forecasting 2010 52
Using Bayesian approach
53
-
Management of Uncertainty Forecast Intevals
Residuals Mixture of predictors Bayesian KWF Need to be valid and adapted to the problem 56 - -
RESIDUALS
57 - -
A confidence interval for the prediction on the KWF model
● The vector of weights w induces a probability distribution ● The KWF predictor is the mean of this empirical distribution ● The CI can be obtained (at least pointwisely) using the quantiles of this distribution ● The quantiles are estimated using bootstrap resampling.
● The sampling probabilities are given by w.
● Corrections need to be made in order to cope with the non stationarity of the functional time series.
Numerical results
● ● ● We test our approach on J+1 forescast during one year of the daily load curve at EDF.
For each day, we compute the mean coverage of the interval.
The empirical mean coverage are 89 %, 85 % and 80 % for the CI of announced level 95 %, 90 % and 80 % respectively.
Antoniadis, Brossat, Cugliari & Poggi
(2013)
preprint
hal.archives-ouvertes.fr/hal-00814530
60 - -
61 - -
Future Work
62
Future Work:
Prospective methods Consumption’s Changes: Abrupt changes (customers losses, unusual meteorological events): Evolving in the uses: heating pump, Sterling motors, battery, micro generation Demand respons services: pilot hot water heater,electric car recharge, contracted interruption Smartgrid Local forecast Renewable Global optimization vs local optimization Methods Changes Parsimonious models (simplicity allows adaptivity?) Regime switching Vectorial models Dimension reduction (factor models…) Clusturing, On line clusturing Multiscale -Mixt Models High dimension Reactive power forecast Non stationnary,locally stationnary processes, Simulation of the uses Mixture of experts with adapted experts
Future Work:
Probalistic forecasts
Do we need a new model for that (ex: quantile modelization)? Can we use our actual models?
Parametric vs nonparametric, time series bootstrap… Distinguishing and quantifying the different uncertainties from data to forecast: Adapted with needs of provider , of optimization Uncertainties on the consumption linked with physical uncertainties