Bridging the Gap Between Bayesian Deep Learning and Ensemble Weather Forecasts
Xinlei Xiong, Wenbo Hu, Shuxun Zhou, Kaifeng Bi, Lingxi Xie, Ying Liu, Richang Hong, Qi Tian
TL;DR
This work targets uncertainty quantification in weather forecasting by bridging Bayesian deep learning (BDL) with traditional ensemble prediction systems. It introduces a hybrid framework that simultaneously models epistemic uncertainty via variational inference on model parameters and aleatoric uncertainty through a physics-informed, flow-dependent perturbation of the atmospheric state using a spherical-harmonic random field. The approach is underpinned by theoretical results that decompose total predictive uncertainty into its epistemic and aleatoric components and validated on the ERA5 reanalysis dataset, showing improved probabilistic skill, better calibration, and substantial computational efficiency over diffusion-based baselines. The combination yields a practical, scalable method for trustworthy AI-based weather prediction with an actionable open-source direction for operational use.
Abstract
Weather forecasting is fundamentally challenged by the chaotic nature of the atmosphere, necessitating probabilistic approaches to quantify uncertainty. While traditional ensemble prediction (EPS) addresses this through computationally intensive simulations, recent advances in Bayesian Deep Learning (BDL) offer a promising but often disconnected alternative. We bridge these paradigms through a unified hybrid Bayesian Deep Learning framework for ensemble weather forecasting that explicitly decomposes predictive uncertainty into epistemic and aleatoric components, learned via variational inference and a physics-informed stochastic perturbation scheme modeling flow-dependent atmospheric dynamics, respectively. We further establish a unified theoretical framework that rigorously connects BDL and EPS, providing formal theorems that decompose total predictive uncertainty into epistemic and aleatoric components under the hybrid BDL framework. We validate our framework on the large-scale 40-year ERA5 reanalysis dataset (1979-2019) with 0.25° spatial resolution. Experimental results show that our method not only improves forecast accuracy and yields better-calibrated uncertainty quantification but also achieves superior computational efficiency compared to state-of-the-art probabilistic diffusion models. We commit to making our code open-source upon acceptance of this paper.
