On the optimal prediction of extreme events in heavy-tailed time series with applications to solar flare forecasting
Victor Verma, Stilian Stoev, Yang Chen
TL;DR
This work derives a Neyman–Pearson-type framework for optimal prediction of extreme events in heavy-tailed time series, showing that calibrated optimal predictors are governed by density ratios $r(X)=f_1(X)/f_0(X)$ and calibrations $F_{r(X)}^{\leftarrow}(q)$. It develops explicit closed-form predictors for additive and linear models, extends the theory to autoregressive and infinite-variance moving-average processes, and provides asymptotic results linking extremal precision to tail dependence coefficients. For practical inference in AR$(d)$ models, a plug-in approach based on robust coefficient estimation yields asymptotically calibrated and optimal predictors under mild regularity conditions; the asymptotic extremal-precision results are derived for MA$(\infty)$ via regular variation theory. The methodology is illustrated with solar-flare forecasting using GOES soft X-ray flux data, comparing baseline, AR, and FARIMA models to quantify fundamental limits and guide operational risk forecasting in a real-world, long-memory, heavy-tailed setting. Overall, the paper formalizes fundamental limits on predicting extremes in heavy-tailed time series and demonstrates how to approach such predictions in practice, with concrete implications for solar flare forecasting and related extreme-event tasks.
Abstract
The prediction of extreme events in time series is a fundamental problem arising in many financial, scientific, engineering, and other applications. We begin by establishing a general Neyman-Pearson-type characterization of optimal extreme event predictors in terms of density ratios. This yields new insights and several closed-form optimal extreme event predictors for additive models. These results naturally extend to time series, where we study optimal extreme event prediction for both light- and heavy-tailed autoregressive and moving average models. Using a uniform law of large numbers for ergodic time series, we establish the asymptotic optimality of an empirical version of the optimal predictor for autoregressive models. Using multivariate regular variation, we obtain an expression for the optimal extremal precision in heavy-tailed infinite moving averages, which provides theoretical bounds on the ability to predict extremes in this general class of models. We address the important problem of predicting solar flares by applying our theory and methodology to a state-of-the-art time series consisting of solar soft X-ray flux measurements. Our results demonstrate the success and limitations in solar flare forecasting of long-memory autoregressive models and long-range-dependent, heavy-tailed FARIMA models.
