An intercomparison of generative machine learning methods for downscaling precipitation at fine spatial scales
Bryn Ward-Leikis, Neelesh Rampal, Yun Sing Koh, Peter B. Gibson, Hong-Yang Liu, Vassili Kitsios, Tristan Meyers, Jeff Adie, Yang Juntao, Steven C. Sherwood
TL;DR
This study benchmarks residual generative downscaling methods for daily precipitation over New Zealand, comparing a residual conditional GAN with an intensity constraint to residual diffusion models against a deterministic baseline within a perfect-model framework. Using a 12 km CCAM-RCM emulator trained on CMIP6 data, it evaluates performance across historical and future climates with metrics capturing mean climatologies, extremes, spatial structure, and climate-change signals. Diffusion models deliver stronger fine-scale spatial realism and dry-spell representation but tend to underpredict extreme precipitation changes, while the cGAN matches or surpasses in most metrics, including extremes and climate-change responses, at substantially lower computational cost. The results highlight that appearance of spatial realism does not guarantee reliable climate-change extrapolation, and they suggest that incorporating physics-based constraints into diffusion models or further tuning could yield robust, efficient downscaling suitable for climate risk assessment and ensemble projections.
Abstract
Machine learning (ML) offers a computationally efficient approach for generating large ensembles of high-resolution climate projections, but deterministic ML methods often smooth fine-scale structures and underestimate extremes. While stochastic generative models show promise for predicting fine-scale weather and extremes, few studies have compared their performance under present-day and future climates. This study compares a previously developed conditional Generative Adversarial Network (cGAN) with an intensity constraint against different configurations of diffusion models for downscaling daily precipitation from a regional climate model (RCM) over Aotearoa New Zealand. Model skill is comprehensively assessed across spatial structure, distributional metrics, means, extremes, and their respective climate change signals. Both generative approaches outperform the deterministic baseline across most metrics and exhibit similar overall skill. Diffusion models better predict the fine-scale spatial structure of precipitation and the length of dry spells, but underestimate climate change signals for extreme precipitation compared to the ground truth RCMs. In contrast, cGANs achieve comparable skill for most metrics while better predicting the overall precipitation distribution and climate change responses for extremes at a fraction of the computational cost. These results demonstrate that while diffusion models can readily generate predictions with greater visual "realism", they do not necessarily better preserve climate change responses compared to cGANs with intensity constraints. At present, incorporating constraints into diffusion models remains challenging compared to cGANs, but may represent an opportunity to further improve skill for predicting climate change responses.
