Valid Error Bars for Neural Weather Models using Conformal Prediction
Vignesh Gopakumar, Joel Oskarrson, Ander Gray, Lorenzo Zanisi, Stanislas Pamela, Daniel Giles, Matt Kusner, Marc Deisenroth
TL;DR
The paper tackles the lack of uncertainty quantification in neural weather forecasts by introducing an inductive conformal prediction (CP) post-processing framework that yields calibrated prediction sets with coverage $1-\alpha$ for every spatio-temporal point, without modifying the underlying model. It extends CP to the spatio-temporal domain, calibrating per-cell predictions using non-conformity scores (RES for deterministic outputs and STD for probabilistic Gaussian outputs) and estimating a quantile $\hat{q}$ to form the prediction intervals. The method is demonstrated on Hi-LAM, a limited-area Nordic neural weather model, using both a deterministic MSE and a probabilistic NLL variant, achieving empirical coverage around $91\%$ at the nominal $90\%$ level and showing tighter bounds for STD than RES. Key contributions include per-cell spatio-temporal calibration, a formal non-conformity score framework, and a practical, low-cost approach to obtaining reliable uncertainty bounds that complement ensemble methods.
Abstract
Neural weather models have shown immense potential as inexpensive and accurate alternatives to physics-based models. However, most models trained to perform weather forecasting do not quantify the uncertainty associated with their forecasts. This limits the trust in the model and the usefulness of the forecasts. In this work we construct and formalise a conformal prediction framework as a post-processing method for estimating this uncertainty. The method is model-agnostic and gives calibrated error bounds for all variables, lead times and spatial locations. No modifications are required to the model and the computational cost is negligible compared to model training. We demonstrate the usefulness of the conformal prediction framework on a limited area neural weather model for the Nordic region. We further explore the advantages of the framework for deterministic and probabilistic models.
