Representing Subgrid-Scale Cloud Effects in a Radiation Parameterization using Machine Learning: MLe-radiation v1.0
Katharina Hafner, Sara Shamekh, Guillaume Bertoli, Axel Lauer, Robert Pincus, Julien Savre, Veronika Eyring
TL;DR
The paper tackles large uncertainties in radiative transfer due to subgrid-scale clouds in coarse-resolution Earth System Models by learning the cloud radiative impact (CRI) from high-resolution, global-storm-resolving simulations. It introduces a hybrid physics-ML radiation parameterization in which a BiLSTM neural network predicts the cloud contribution to heating rates from vertical state profiles, while a conventional physics-based scheme provides clear-sky fluxes; CRI is defined as the difference between all-sky and clear-sky heating rates, $\frac{\partial T_{CRI}}{\partial t} = \frac{\partial T_{all-sky}}{\partial t} - \frac{\partial T_{clear-sky}}{\partial t}$. Training uses 5 km QUBICC data coarse-grained to target resolutions (~80 km) and is evaluated against a McICA-based pyRTE+RRTMGP baseline across shortwave and longwave spectra, including fully and partially cloudy conditions and across multiple regions. The results show substantial reductions in heating-rate errors (factors of 4–11) with the ML-enhanced scheme, demonstrating the potential to encode subgrid-cloud variability into radiation schemes for next-generation Earth System Models. The study also discusses limitations due to unresolved shallow convection and aerosol absence in high-resolution data, and outlines online deployment and aerosol integration as future directions, highlighting a path toward more accurate, generalizable radiation parameterizations with potential computational savings.
Abstract
Improvements of Machine Learning (ML)-based radiation emulators remain constrained by the underlying assumptions to represent horizontal and vertical subgrid-scale cloud distributions, which continue to introduce substantial uncertainties. In this study, we introduce a method to represent the impact of subgrid-scale clouds by applying ML to learn processes from high-resolution model output with a horizontal grid spacing of 5km. In global storm resolving models, clouds begin to be explicitly resolved. Coarse-graining these high-resolution simulations to the resolution of coarser Earth System Models yields radiative heating rates that implicitly include subgrid-scale cloud effects, without assumptions about their horizontal or vertical distributions. We define the cloud radiative impact as the difference between all-sky and clear-sky radiative fluxes, and train the ML component solely on this cloud-induced contribution to heating rates. The clear-sky tendencies remain being computed with a conventional physics-based radiation scheme. This hybrid design enhances generalization, since the machine-learned part addresses only subgrid-scale cloud effects, while the clear-sky component remains responsive to changes in greenhouse gas or aerosol concentrations. Applied to coarse-grained data offline, the ML-enhanced radiation scheme reduces errors by a factor of 4-10 compared with a conventional coarse-scale radiation scheme. This shows the potential of representing subgrid-scale cloud effects in radiation schemes with ML for the next generation of Earth System Models.
