Examples of Four-Dimensional Data Assimilation in Oceanography Ibrahim Hoteit University of Maryland October 3, 2007
Download ReportTranscript Examples of Four-Dimensional Data Assimilation in Oceanography Ibrahim Hoteit University of Maryland October 3, 2007
Examples of Four-Dimensional Data Assimilation in Oceanography Ibrahim Hoteit University of Maryland October 3, 2007 Outline 4D Data Assimilation 4D-VAR and Kalman Filtering Application to Oceanography Examples in Oceanography 4D-VAR Assimilation Tropical Pacific, San Diego, … Filtering Methods Mediterranean Sea, Coupled models, Nonlinear filtering ... Discussion and New Applications 1 Data Assimilation Goal: Estimate the state of a dynamical system Information: Imperfect dynamical Model: xk 1 M k 1,k ( xk ) k 1 x state vector, N(0,Q) model error M k 1,k transition operator form k to k 1 Sparse observations: yko Hk ( xk ) k y observation vector, N(0, R) observation error H k observational operator A priori Knowledge: xb and its uncertainties B o 2 Data Assimilation Data assimilation: Use all available information to determine the best possible estimate of the system state Observations show the real trajectory to the model Model dynamically interpolates the observations a 3D assimilation: Determine an estimate x of the state at a given time given an observation y o by minimizing T J ( x) y Hx R 1 y o Hx xb x B1 xb x o 4D assimilation: Determine x1a , , xNa given y1o , 4D-VAR and Kalman Filtering , yNo 3 4D-VAR Approach Optimal Control: Look for the model trajectory that best fits the observations by adjusting a set of “control variables” minimize J (c) k 1 y H k xk R 1 yko H k xk cb c B1 cb c N o k T with the model as constraint: xk 1 M k 1,k ( xk ) k 1 c is the control vector and may include any model parameter (IC, OB, bulk coefficients, etc) … and model errors Use a gradient descent algorithm to minimize J Most efficient way to compute the gradients is to run the adjoint model backward in time 4 Kalman Filtering Approach Bayesian estimation: Determine pdf of xk given y1:ok y1o , , yko Px k y1o:k Minimum Variance (MV) estimate (minimum error on average) xka E xk / y1:ok Maximum a posteriori (MAP) estimate (most likely) xka Arg max Px k y1:ok Kalman filter (KF): provides the MV (and MAP) estimate for linearGaussian Systems 5 The Kalman Filter (KF) Algorithm Initialization Step: x0 and P0 Analysis Step (observation) Forecast Step (model) Forecast state xkf M k ,k 1xka1 Forecast Error covariance Pkf Mk ,k 1 Pka1 Mkk1 Qk Kalman Gain Gk Pkf HkT [ Hk Pkf HkT R k ]1 Analysis state xka xkf Gk [ yko Hk xkf ] Analysis Error covariance Pka Pkf Gk Hkf Pkf 6 Application to Oceanography 4D-VAR and the Kalman filter lead to the same estimate at the end of the assimilation window when the system is linear, Gaussian and perfect Nonlinear system: 4D-VAR cost function is non-convex multiple minima Linearize the system suboptimal Extended KF (EKF) System dimension ~ 108: 4D-VAR control vector is huge KF error covariance matrices are prohibitive Errors statistics: Poorly known R, Q, B Non-Gaussian: KF is still the MV among linear estimators 7 4D Variational Assimilation ECCO 1o Global Assimilation System Eddy-Permitting 4D-VAR Assimilation ECCO Assimilation Efforts at SIO Tropical Pacific, San Diego, … In collaboration with the ECCO group, especially Armin Köhl*, Detlef Stammer*, Patrick Heimbach** *Universitat Hamburg/Germany, **MIT/USA 8 ECCO 1o Global Assimilation System Model: Data: MITGCM (TAF-compiler enabled) NCEP forcing and Levitus initial conditions Altimetry (daily): SLA TOPEX, ERS SST (monthly): TMI and Reynolds Profiles (monthly) : XBTs, TAO, Drifters, SSS, ... Climatology (Levitus S/T) and Geoid (Grace mission) Assimilation scheme: 4D-VAR with control of the initial conditions and the atmospheric forcing (with diagonal weights!!!) ECCO reanalysis: 1o global ocean state and atmospheric forcing from 1992 to 2004, …and from 1952 2001 (Stammer et al. …) 9 ECCO Solution Fit Equatorial Under Current (EUC) Johnson ECCO 10 ECCO Tropical Pacific Configuration Regional: 26oS 26oN, 1/3o, 50 layers, ECCO O.B. Data: TOPEX, TMI SST, TAO, XBT, CTD, ARGO, Drifters; all at roughly daily frequency MITGCM Tropical Climatology: Levitus-T and S,Pacific Reynolds SST and GRACE Control: Initial conditions , 2-daily forcing, and weekly O.B. Smoothing: Smooth ctrl fields using Laplacian in the horizontal and first derivatives in the vertical and in time First guess: Levitus (I.C.), NCEP (forcing), ECCO (O.B.) OB = (U,V,S,T) 11 Eddy-Permitting 4D-VAR Assimilation The variables of the adjoint model exponentially increase in time Typical behavior for the adjoint of a nonlinear chaotic model Indicate unpredictable events and multiple local minima Correct gradients but wrong sensitivities Invalidate the use of a gradient-based optimization algorithm Assimilate over short periods (2 months) where the adjoint is stable Replace the original unstable adjoint with the adjoint of a tangent linear model which has been modified to be stable (Köhl et al., Tellus-2002) Exponentially increasing gradients were filtered out using larger viscosity and diffusivity terms in the adjoint model 12 HFL gradients after 45 days with increasing viscosities Visc = 1e11 & Diff = 4e2 20*Visc & 20*Diff 10*Visc & 10*Diff 30*Visc & 30*Diff 13 Initial temperature gradients after 1 year (2000) 10*Visc & 10*Diff 20*Visc & 20*Diff 14 Data Cost Function Terms 1/3;39 1/6;39 1;39 1;23 15 Control Cost Function Terms 1/3;39 1/6;39 1;39 1;23 16 Fit to Data 1/3;39 1/6;39 1;39 1;23 17 Assimilation Solution (weekly field end of August) 18 What Next … Fit is quite good and assimilation solution is reasonable Extend assimilation period over several years Add new controls to enhance the controllability of the system and reduce errors in the controls Improve control constraints … Some references Hoteit et al. (QJRMS-2006) Hoteit et al. (JAOT-2007) Hoteit et al. (???-2007) 19 Other MITGCM Assimilation Efforts at SIO 1/10o CalCOFI 4D-VAR assimilation system Predicting the loop current in the Gulf of Mexico … San Diego high frequency CODAR assimilation ● Assimilate hourly HF radar data and other data Adjoint effectiveness at small scale Information content of surface velocity data ● MITGCM with 1km resolution and 40 layers ● Control: I.C., hourly forcing and O.B. ● First guess: one profile T, S and TAU (no U, V, S/H-FLUX) ● Preliminary results: 1 week, no tides 20 Model Domain and Radars Coverage Time evolution of the normalized radar cost 1/3;39 1/6;39 1;39 1;23 21 Assimilation Solution: SSH / (U,V) & Wind Adj. 1/3;39 1/6;39 1;39 1;23 22 What Next … Assimilation over longer periods Include tidal forcing Coupling with atmospheric model Nesting into the CalCOFI model 23 Filtering Methods Low-Rank Extended/Ensemble Kalman Filtering SEEK/SEIK Filters Application to Mediterranean Sea Kalman Filtering for Coupled Models Particle Kalman Filtering In collaboration with D.-T. Pham*, G. Triantafyllou**, G. Korres** *CNRS/France, **HCMR/Greece 24 Low-rank Extended/Ensemble Kalman Filtering Reduced-order Kalman filters: Project x on a low-dim subspace xk Lxk Pk L Pk L T x (n); x (r ) L (n, r ); P ( r , r ) Kalman correction along the directions of L Reduced error subspace Kalman filters: Pk has low-rank Pk Lk Pk Lk T Ensemble Kalman filters: Monte Carlo approach to Pk 1 N i i i i T ( x x )( x x i1 k k k k ) N Correction along the directions of [ x1k xki x1k xkN ] 25 Singular Evolutive Kalman (SEEK) Filters Low-rank (r) error subspace Kalman filters: x0 ,P0 L0U0 LT0 Analysis Forecast x M k (x f k a k 1 Lk M k Lk 1 ) 1 k U U 1 k 1 Hk Lk R Hk Lk 1 i T x x LkU k Hk Lk R k1[ yko H k ( xkf )] a k f k T A “collection” of SEEK filters: SEEK: Extended variant M Μ SFEK: Fixed variant M Id L L0 SEIK: Ensemble variant with (r+1) members only! (~ETKF) Inflation and Localization 26 The Work Package WP12 in MFSTEP EU project between several European institutes Assimilate physical & biological observations into coupled ecosystem models of the Mediterranean Sea: Develop coupled physical-biological model for regional and coastal areas of the Mediterranean Sea Implement Kalman filtering techniques with the physical and biological model … Investigate the capacity of surface observations (SSH, CHL) to improve the behavior of the coupled system 27 The Coupled POM – BFM Model One way coupled: Ecology does not affect the physics x p n, u, v, u, v, T, S, q2 , q2l O2o,O3o, N1p, N3n, N5s, N4n, e x P1,2,3,4 (c,n,p,s,i),Z4,5,6 (c,n,p),R 6(c,n,p,s), R1(c,n,p),B1(c,n,p),Q1,6,7 (c,n,p,s) 28 A Model Snapshot 1/10o Eastern Mediterranean configuration 25 layers Elevation and Mean Velocity Mean CHL integrated 1-120m 29 Assimilation into POM Model = 1/10o Mediterranean configuration with 25 layers Observations = Altimetry, SST, Profiles T & S profiles, Argo data, and XBTs on a weekly basis SEIK Filter with rank 50 (51 members) Initialization = EOFs computed from 3-days outputs of a 3-year model integration Inflation factor = 0.5 Localization = 400 Km 30 Assimilation into POM Mean Free-run RMS Error SSH RMS Misfits Mean Forecast RMS Error Free-Run Forecast Obs Error = 3cm Mean Analysis RMS Error Analysis 31 Salinity RMS Error Time Series FerryBox data at Rhone River SSH 07/12/2005 07/12/05: SATELLITE SSH FREE RUN FORECAST ANALYSIS 6/11/2015 ECOOP KICK-OFF 32 Assimilation into BFM Model = 1/10o Eastern Mediterranean with 25 layers with perfect physics Observations = SeaWiFS CHL every 8 days in 1999 SFEK Filter = SEEK with invariant correction subspace Correction subspace = 25 EOFs computed from 2-days outputs of a one year model integration Inflation factor = 0.3 Localization = 200 Km 33 Assimilation into BFM CHL RMS Misfits Forecast Free-Run Analysis 34 CHL Cross-Section at 34oN Ph Cross-Section at 28oE 35 Kalman Filtering for Coupled Models Physical System p p p p xk M k ( xk 1 ) k p p p p y H ( x ) k k k k Ecological System e e e p e x M ( x , x ) k k k 1 k k e e e e yk H k ( xk ) k MAP: Direct maximization of the joint conditional density xp e Max Px p xe y p ye x standard Kalman filter estimation problem Joint approach: strong coupling and same filter (rank) !!! Dual Approach Decompose the joint density into marginal densities Px p xe y p ye Pxe P x p y p ye x p y p ye Compute MAP estimators from each marginal density x p Arg max Px p y p ye & x e Arg max Pxe x p y p ye Separate optimization leads to two Kalman filters … Different degrees of simplification and ranks for each filter significant cost reduction Same e x from the joint or the dual approach p e The physical filter assimilate y and y RRMS for state vectors Twin-Experiments 1/10o Eastern Mediterranean (25 layers) Joint: SEEK rank-50 RRMS . X REF X FILTER Dual: SEEK rank-50 for physics SFEK rank-20 for biology X REF X Physics Biology Ref Dual Joint Ref Dual Joint 38 What Next … Joint/Dual Kalman filtering with real data State/Parameter Kalman estimation Better account for model errors Some references Hoteit et al. (JMS-2003), Triantafyllou et al. (JMS-2004), Hoteit et al. (NPG-2005), Hoteit et al. (AG-2005), (Hoteit et al., 2006), Korres et al. (OS-2007) 39 Nonlinear Filtering - Motivations The EnKF is “semi-optimal”; it is analysis step is linear The optimal solution can be obtained from the optimal nonlinear filter which provides the state pdf given previous data Particle filter (PF) approximates the state pdf by mixture of Dirac N functions i 1 wi x , but suffers from the collapse (degeneracy) of its particles (analysis step only update the weights wi ) i Surprisingly, recent results suggest that the EnKF is more stable than the PF for small ensembles because the Kalman correction attenuates the collapse of the ensemble 40 The Particle Kalman Filter (PKF) The PKF uses a Kernel estimator to approximate the pdfs of the nonlinear filter by a mixture of Gaussian densities Px / y 1:k 1 i1 wki k 1 G x xki k 1; ik k 1 N The state pdfs can be always approximated by mixture of Gaussian densities of the same form: Analysis Step: Kalman-type: EKF analysis to update x i and Pi Particle-type: weight update (but using instead of R ) i Forecast Step: EKF forecast step to propagate x and P i Resampling Step: … 41 Particle Kalman Filtering in Oceanography It is an ensemble of extended Kalman filters with weights!! Particle Kalman Filtering requires simplification of the particles error covariance matrices The EnKF can be derived as a simplified PKF Hoteit et al. (MWR-2007) successfully tested one low-rank PKF with twin experiments What Next … Derive and test several simplified variants of the PKF Assess the relevance of a nonlinear analysis step: comparison with the EnKF Assimilation of real data … 42 Discussion and New Applications Advanced 4D data assimilation methods can be now applied to complex oceanic and atmospheric problems More work is still needed for the estimation of the error covariance matrices, the assimilation into coupled models, and the implementation of the optimal nonlinear filter New Applications: ENSO prediction using neural models and Kalman filters Hurricane reconstruction using 4D-VAR ocean assimilation! Ensemble sensitivities and 4D-VAR Optimization of Gliders trajectories in the Gulf of Mexico … THANK YOU 43 44 4D-VAR or (Ensemble) Kalman Filter? 4D-VAR Easier to understand EnKF Easier to incorporate a complex background covariance matrix More portable (easier to implement?) Support different degrees of simplifications Low-rank estimates of the error cov. matrices (better forecast!) Dynamically consistent solution Still room for improvement … No low-rank deficiency 4D-VAR or EnkF? … 45 Sensitivity to first guess (25 Iterations) 46 Comparison with TAO-Array RMS RMS Zonal Velocity (m/s) 1/3;39 1/6;39 RMS Meridional Velocity (m/s) 1;39 1;23 47 San Diego HF Radar Currents Assimilation Assimilate hourly HF radar data and other data Goals: Adjoint effectiveness at small scale Information content of surface velocity data Dispersion of larvae, nutrients, and pollutants MITGCM with 1km resolution with 40 layers Control: I.C., hourly forcing, and O.B. First guess: one profile, no U and V, and no forcing! Preliminary results: 1 week, no tides 48 Cost Function terms 1/3;39 1/6;39 1;39 1;23 49 Assimilation Solution: U & V 1/3;39 1/6;39 1;39 1;23 50 Assimilation Solution 1/3;39 1/6;39 1;39 1;23 51 Gulf of Mexico Loop current prediction Observations HF radar, Gliders, ADCP, … Adjoint effectiveness … 1/10o with 50 layers Ctrl: I.C. (S,T), daily forcing, and weekly O.B. Proof of concept assimilating SSH, Levitus and Reynolds … 52 What Next … Ensemble forecasting Ensemble Kalman filtering Optimization of observation systems 53 Twin-Experiments Setup Model = 1/10o Eastern Mediterranean with 25 layers 1996 2000 Spin up 4 years 05/03/02 EOFs 2 years 05/03/02 REF – OBS 3 months Pseudo-obs: SSH and CHL surface data every 3 days Initialization: start from mean state of the 2 years run Free-run: run without assimilation starting from mean state Evaluation: RMS misfit relative to the misfit from mean state RRMS X REF X FILTER X REF X 54 Low-rank Extended/Ensemble Kalman Filtering EnKF a ,i a low-dim f ,i f ,i Reduced-order Kalmanf ,ifilters: subspace ● Project xxon ● x G [ y Hx xk k k k ] ● ● x T(r ) f T 1 x (Pnf);H T xka,i1 ● ● xModel G [H P H R ] ● Sxk ● ●Pk S●Pk S k ● ● ● ● a n, r ); xPa ,(i r), r ) ● ● S(cov( ● P ●● ● ● ●● k ● Data ● ● ● ● ● of S ● ● ●Analysis along the directions ● ● SPF ● f f ,i a f f ,i x x ● Reduced k k error subspace Kalman filters: has low-rank x x G [ y Hx ● k Pk k k ] f f ,i f T f T 1 P cov( x ) T G P H [H P H R ] Pk S P Sk k k k k Resampling Pa P f G H P f Ensemble Kalman filters: Monte Carlo approach to Pk Pk 1 N i i i i T ( x x )( x x i1 k k k k ) N Analysis along the directions of [ x1k xki x1k xkN ] 55 Low-Rank Deficiency Issues Error covariance matrices are underestimated Few degrees of freedom to fit the data Amplification by an inflation factor P P f f Localization of the covariance matrix (using Schur product) 56 Joint Approach Direct maximization of the joint conditional density standard Kalman filter estimation problem acting on xkp M kp ( xkp1 ) kp xk e e e e x M ( x ) k k k 1 k and assimilating ykp H kp ( xkp ) kp yk e e e e y H (x ) k k k k Issues: Strong coupling and same filter (rank) 57 Dual Approach – Some Facts e Only the second marginal density depends on x , this means same x e from the joint or the dual approach p e x does not depend on x , more in line with the one-way coupling of the system p e The physical filter assimilates both y and y : assimilation of y e guaranties consistency between the two subsystems e The ecological filter assimilates only y , but it is forced with the solution of the physical filter The linearization of the observational operator in the physical filter is a complex operation because of the dependency of the ecology on x p, it was neglected in this preliminary application Twin-Experiments Setup Model = 1/10o Eastern Mediterranean with 25 layers 1996 2000 Spin up 4 years 05/03/02 EOFs 2 years 05/03/02 REF – OBS 3 months Pseudo-obs: SSH and CHL surface data every 3 days Initialization: start from mean state of the 2 years run Free-run: run without assimilation starting from mean state Evaluation: RMS misfit relative to the misfit from mean state RRMS X REF X FILTER X REF X 59 The Optimal Nonlinear Filter As the Kalman filter, it operates as a succession of forecast and analysis steps to update the state pdf: Forecast Step: Integrate the analysis pdf with the model pk k 1 ( x | y1:k 1 ) n x M k (u ); Qk pk 1 (u | y1:k 1 )du Analysis Step: Correct the predictive pdf with the new data pk ( x | y1:k ) 1 bk pk k 1 ( x | y1:k 1 ) yk H k ( x); Rk Particle Filter approximates the state pdf by mixture of Dirac functions N wi x , but suffers from degeneracy. i 1 i 60 New Directions/Applications New Applications ENSO prediction using surrogate models and Kalman filters Hurricane reconstruction using 4DVAR ocean assimilation! Ensemble sensitivities and 4DVAR Other Interests Optimal Observations Estimate Model and Observational Errors Estimate Background Covariance Matrices in 4DVAR Study the behavior of the different 4DVAR methods with highly nonlinear models 61