Calibrating Bayesian UNet++ for Sub-Seasonal Forecasting
Busra Asan, Abdullah Akgül, Alper Unal, Melih Kandemir, Gozde Unal
TL;DR
The paper addresses the challenge of producing reliable, calibrated sub-seasonal temperature forecasts. It extends UNet++ to a Bayesian CNN by treating the final layers as Bayesian with weights $\theta \sim \mathcal{N}$ and optimizes the $ELBO$ to yield predictive distributions, then couples this with a CDF-based calibration using an isotonic regressor $R: [0,1] \to [0,1]$ so that $R \circ F_t$ reflects empirical frequencies. Key findings show that well-calibrated Bayesian forecasts achieve higher sharpness and more accurate coverage than MC-Dropout or Deep Ensemble baselines, albeit with a trade-off in traditional point-error metrics like MAE. The approach leverages CMIP6 for training and ERA5 for fine-tuning, delivering calibrated and sharper probabilistic forecasts that can be generalized to other climate variables and safety-critical forecasting tasks.
Abstract
Seasonal forecasting is a crucial task when it comes to detecting the extreme heat and colds that occur due to climate change. Confidence in the predictions should be reliable since a small increase in the temperatures in a year has a big impact on the world. Calibration of the neural networks provides a way to ensure our confidence in the predictions. However, calibrating regression models is an under-researched topic, especially in forecasters. We calibrate a UNet++ based architecture, which was shown to outperform physics-based models in temperature anomalies. We show that with a slight trade-off between prediction error and calibration error, it is possible to get more reliable and sharper forecasts. We believe that calibration should be an important part of safety-critical machine learning applications such as weather forecasters.
