Gradient-free online learning of subgrid-scale dynamics with neural emulators
Hugo Frezat, Ronan Fablet, Guillaume Balarac, Julien Le Sommer
TL;DR
The paper addresses online training of subgrid-scale (SGS) parametrizations for non-differentiable climate solvers by introducing a neural emulator of the coarse solver to enable gradient-based optimization. It presents a two-step training scheme that separately learns a differentiable emulator and an SGS parametrization with compensated losses to minimize bias arising from emulator errors. Demonstrations on a chaotic two-timescale Lorenz-96 system and a barotropic quasi-geostrophic flow show that emulator-based online training can approach the performance of true online training without requiring solver adjoints, with emulator quality and loss design playing pivotal roles. The work highlights potential for applying gradient-free online strategies to larger, production-scale climate models and motivates further improvements in emulator architectures and evaluation metrics. Overall, it advances a practical route toward stable, data-driven SGS closures in earth-system modeling.
Abstract
In this paper, we propose a generic algorithm to train machine learning-based subgrid parametrizations online, i.e., with \textit{a posteriori} loss functions, but for non-differentiable numerical solvers. The proposed approach leverages a neural emulator to approximate the reduced state-space solver, which is then used to allow gradient propagation through temporal integration steps. We apply this methodology on a chaotic two-timescales Lorenz-96 system and a single layer quasi-geostrophic system with zonal dynamics, known to be highly unstable with offline strategies. Using our algorithm, we are able to train a parametrization that recovers most of the benefits of online strategies without having to compute the gradient of the original solver. We found that training the neural emulator and parametrization components separately with different loss quantities is necessary in order to minimize the propagation of approximation biases. Experiments on emulator architectures with different complexities also indicates that emulator performance is key in order to learn an accurate parametrization. This work is a step towards learning parametrization with online strategies for climate models.
