Towards Physically Consistent Deep Learning For Climate Model Parameterizations
Birgit Kühbacher, Fernando Iglesias-Suarez, Niki Kilbertus, Veronika Eyring
TL;DR
This work tackles the challenge of physically inconsistent and uninterpretable DL-based climate model parameterizations by introducing PCMasking, a data-driven framework that first uncovers physical input drivers through a sparsity-promoting pre-masking phase and then enforces physical consistency via a thresholded input mask during a masking fine-tuning phase. The method maintains predictive performance comparable to causally-informed baselines while substantially reducing computational overhead, as demonstrated on SPCAM aquaplanet data with one model per output variable. SHAP analyses show that PCMasking suppresses non-physical, long-range input–output links and concentrates on plausible local drivers, improving interpretability. Cross-climate experiments indicate that the framework identifies robust physical drivers across climates, though generalization remains an area for future improvement. Overall, PCMasking advances data-driven climate parameterizations by combining physical driver selection with automatic, architecture-friendly training, offering practical benefits for scalable ensemble forecasting and deeper physical insight.
Abstract
Climate models play a critical role in understanding and projecting climate change. Due to their complexity, their horizontal resolution of about 40-100 km remains too coarse to resolve processes such as clouds and convection, which need to be approximated via parameterizations. These parameterizations are a major source of systematic errors and large uncertainties in climate projections. Deep learning (DL)-based parameterizations, trained on data from computationally expensive short, high-resolution simulations, have shown great promise for improving climate models in that regard. However, their lack of interpretability and tendency to learn spurious non-physical correlations result in reduced trust in the climate simulation. We propose an efficient supervised learning framework for DL-based parameterizations that leads to physically consistent models with improved interpretability and negligible computational overhead compared to standard supervised training. First, key features determining the target physical processes are uncovered. Subsequently, the neural network is fine-tuned using only those relevant features. We show empirically that our method robustly identifies a small subset of the inputs as actual physical drivers, therefore removing spurious non-physical relationships. This results in by design physically consistent and interpretable neural networks while maintaining the predictive performance of unconstrained black-box DL-based parameterizations.
