Information-theoretic analysis of temporal dependence in discrete stochastic processes: Application to precipitation predictability

Juan De Gregorio; David Sánchez; Raúl Toral

Information-theoretic analysis of temporal dependence in discrete stochastic processes: Application to precipitation predictability

Juan De Gregorio, David Sánchez, Raúl Toral

TL;DR

This work develops an information-theoretic framework to quantify temporal memory in discrete stochastic processes via the predictability gain $\mathcal{G}_u$, derived from block entropies $H_r$, and links it to the entropy rate $h$ through $G_T=H_1-h$. It introduces a bootstrap-based hypothesis-testing procedure and Fisher’s method to robustly estimate the memory order $\hat{m}^{\text{PG}}$ from finite data, outperforming AIC and BIC in simulations. Applied to daily precipitation records across the contiguous United States, the method reveals a dominance of low-order Markov memory ($m\in\{0,1\}$) with pronounced seasonal and regional variation (e.g., stronger West Coast winter correlations, stronger Southeast summer correlations). The resulting framework provides a transparent, data-driven approach for memory-aware stochastic modeling and real-time forecasting in spatially heterogeneous systems, with potential extension to other domains exhibiting short-term temporal dependencies.

Abstract

Understanding the temporal dependence of precipitation is key to improving weather predictability and developing efficient stochastic rainfall models. We introduce an information-theoretic approach to quantify memory effects in discrete stochastic processes and apply it to daily precipitation records across the contiguous United States. The method is based on the predictability gain, a quantity derived from block entropy that measures the additional information provided by higher-order temporal dependencies. This statistic, combined with a bootstrap-based hypothesis testing and Fisher's method, enables a robust memory estimator from finite data. Tests with generated sequences show that this estimator outperforms other model-selection criteria such as AIC and BIC. Applied to precipitation data, the analysis reveals that daily rainfall occurrence is well described by low-order Markov chains, exhibiting regional and seasonal variations, with stronger correlations in winter along the West Coast and in summer in the Southeast, consistent with known climatological patterns. Overall, our findings establish a framework for building parsimonious stochastic descriptions, useful when addressing spatial heterogeneity in the memory structure of precipitation dynamics, and support further advances in real-time, data-driven forecasting schemes.

Information-theoretic analysis of temporal dependence in discrete stochastic processes: Application to precipitation predictability

TL;DR

Abstract

Information-theoretic analysis of temporal dependence in discrete stochastic processes: Application to precipitation predictability

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (6)