Beyond the Training Data: Confidence-Guided Mixing of Parameterizations in a Hybrid AI-Climate Model
Helge Heuer, Tom Beucler, Mierk Schwabe, Julien Savre, Manuel Schlund, Veronika Eyring
TL;DR
This work tackles the instability of hybrid AI–climate models by transferring a ClimSim-trained BiLSTM convection parameterization to the ICON-A model and making its use tunable via confidence-guided mixing with a conventional convection scheme. The authors implement a physics-informed, uncertainty-aware loss and add pretraining-time input noise to improve long-term stability, enabling stable year-long and 20-year AMIP-style simulations. Key contributions include a rigorous data preprocessing step to isolate convection, a two-head network that estimates predictive uncertainty, and a mixing strategy that reduces extrapolation risk while remaining interpretable across environmental regimes. The findings show that the mixed, physics-informed approach can outperform the baseline Tiedtke parameterization in several observational benchmarks, while preserving conservation and achieving robust long-term stability, with practical implications for deploying ML-enhanced parameterizations in operationally relevant climate models.
Abstract
Persistent systematic errors in Earth system models (ESMs) arise from difficulties in representing the full diversity of subgrid, multiscale atmospheric convection and turbulence. Machine learning (ML) parameterizations trained on short high-resolution simulations show strong potential to reduce these errors. However, stable long-term atmospheric simulations with hybrid (physics + ML) ESMs remain difficult, as neural networks (NNs) trained offline often destabilize online runs. Training convection parameterizations directly on coarse-grained data is challenging, notably because scales cannot be cleanly separated. This issue is mitigated using data from superparameterized simulations, which provide clearer scale separation. Yet, transferring a parameterization from one ESM to another remains difficult due to distribution shifts that induce large inference errors. Here, we present a proof-of-concept where a ClimSim-trained, physics-informed NN convection parameterization is successfully transferred to ICON-A. The scheme is (a) trained on adjusted ClimSim data with subtracted radiative tendencies, and (b) integrated into ICON-A. The NN parameterization predicts its own error, enabling mixing with a conventional convection scheme when confidence is low, thus making the hybrid AI-physics model tunable with respect to observations and reanalysis through mixing parameters. This improves process understanding by constraining convective tendencies across column water vapor, lower-tropospheric stability, and geographical conditions, yielding interpretable regime behavior. In AMIP-style setups, several hybrid configurations outperform the default convection scheme (e.g., improved precipitation statistics). With additive input noise during training, both hybrid and pure-ML schemes lead to stable simulations and remain physically consistent for at least 20 years.
