TY - JOUR
T1 - Information theory, model error, and predictive skill of stochastic models for complex nonlinear systems
AU - Giannakis, Dimitrios
AU - Majda, Andrew J.
AU - Horenko, Illia
N1 - Funding Information:
This research of Andrew Majda is partially supported by NSF grant DMS-0456713 , by ONR DRI grants N25-74200-F6607 and N00014-10-1-0554 , and by DARPA grants N00014-07-10750 and N00014-08-1-1080 . Dimitrios Giannakis is supported as a postdoctoral fellow through the last three agencies. The authors wish to thank Paul Fischer for providing computational resources at Argonne National Laboratory. Much of this research was developed while the authors were participants in the long program at the Institute for Pure and Applied Mathematics (IPAM) on Hierarchies for Climate Science, which is supported by NSF, and in a recent month-long visit of DG and AJM to the University of Lugano.
PY - 2012/10/15
Y1 - 2012/10/15
N2 - Many problems in complex dynamical systems involve metastable regimes despite nearly Gaussian statistics with underlying dynamics that is very different from the more familiar flows of molecular dynamics. There is significant theoretical and applied interest in developing systematic coarse-grained descriptions of the dynamics, as well as assessing their skill for both short- and long-range prediction. Clustering algorithms, combined with finite-state processes for the regime transitions, are a natural way to build such models objectively from data generated by either the true model or an imperfect model. The main theme of this paper is the development of new practical criteria to assess the predictability of regimes and the predictive skill of such coarse-grained approximations through empirical information theory in stationary and periodically-forced environments. These criteria are tested on instructive idealized stochastic models utilizing K-means clustering in conjunction with running-average smoothing of the training and initial data for forecasts. A perspective on these clustering algorithms is explored here with independent interest, where improvement in the information content of finite-state partitions of phase space is a natural outcome of low-pass filtering through running averages. In applications with time-periodic equilibrium statistics, recently developed finite-element, bounded-variation algorithms for nonstationary autoregressive models are shown to substantially improve predictive skill beyond standard autoregressive models.
AB - Many problems in complex dynamical systems involve metastable regimes despite nearly Gaussian statistics with underlying dynamics that is very different from the more familiar flows of molecular dynamics. There is significant theoretical and applied interest in developing systematic coarse-grained descriptions of the dynamics, as well as assessing their skill for both short- and long-range prediction. Clustering algorithms, combined with finite-state processes for the regime transitions, are a natural way to build such models objectively from data generated by either the true model or an imperfect model. The main theme of this paper is the development of new practical criteria to assess the predictability of regimes and the predictive skill of such coarse-grained approximations through empirical information theory in stationary and periodically-forced environments. These criteria are tested on instructive idealized stochastic models utilizing K-means clustering in conjunction with running-average smoothing of the training and initial data for forecasts. A perspective on these clustering algorithms is explored here with independent interest, where improvement in the information content of finite-state partitions of phase space is a natural outcome of low-pass filtering through running averages. In applications with time-periodic equilibrium statistics, recently developed finite-element, bounded-variation algorithms for nonstationary autoregressive models are shown to substantially improve predictive skill beyond standard autoregressive models.
KW - Autoregressive models
KW - Clustering algorithms
KW - Information theory
KW - Model error
KW - Predictability
KW - Stochastic models
UR - http://www.scopus.com/inward/record.url?scp=84865980981&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84865980981&partnerID=8YFLogxK
U2 - 10.1016/j.physd.2012.07.005
DO - 10.1016/j.physd.2012.07.005
M3 - Article
AN - SCOPUS:84865980981
SN - 0167-2789
VL - 241
SP - 1735
EP - 1752
JO - Physica D: Nonlinear Phenomena
JF - Physica D: Nonlinear Phenomena
IS - 20
ER -