Retrofitting Earth System Models with Cadence-Limited Neural Operator Updates
Aniruddha Bora, Shixuan Zhang, Khemraj Shukla, Bryce Harrop, George Em. Karniadakis, L. Ruby Leung
TL;DR
This work presents cadence-limited online bias corrections for a legacy Earth system model by learning neural-operator updates that map instantaneous states to nudging tendencies. It introduces two FiLM-conditioned UNet-inspired architectures, IUNet and M&M, with multiscale, fully differentiable upsampling designed for online deployment in EAMv2. Offline tests show strong generalization and that M&M offers the best overall fidelity; online integrations demonstrate stable, 2–10% RMSE reductions across key fields, with IUNet and M&M providing the most robust improvements. The study emphasizes stability, portability, and computational feasibility, laying groundwork for extended training and online-learning strategies to further retrofit legacy climate models with expressive neural operators.
Abstract
Coarse resolution, imperfect parameterizations, and uncertain initial states and forcings limit Earth-system model (ESM) predictions. Traditional bias correction via data assimilation improves constrained simulations but offers limited benefit once models run freely. We introduce an operator-learning framework that maps instantaneous model states to bias-correction tendencies and applies them online during integration. Building on a U-Net backbone, we develop two operator architectures Inception U-Net (IUNet) and a multi-scale network (M\&M) that combine diverse upsampling and receptive fields to capture multiscale nonlinear features under Energy Exascale Earth System Model (E3SM) runtime constraints. Trained on two years E3SM simulations nudged toward ERA5 reanalysis, the operators generalize across height levels and seasons. Both architectures outperform standard U-Net baselines in offline tests, indicating that functional richness rather than parameter count drives performance. In online hybrid E3SM runs, M\&M delivers the most consistent bias reductions across variables and vertical levels. The ML-augmented configurations remain stable and computationally feasible in multi-year simulations, providing a practical pathway for scalable hybrid modeling. Our framework emphasizes long-term stability, portability, and cadence-limited updates, demonstrating the utility of expressive ML operators for learning structured, cross-scale relationships and retrofitting legacy ESMs.
