EnScale: Temporally-consistent multivariate generative downscaling via proper scoring rules
Maybritt Schillinger, Maxim Samarin, Xinwei Shen, Reto Knutti, Nicolai Meinshausen
TL;DR
EnScale tackles the challenge of generating high-resolution, temporally coherent, multivariate climate fields conditioned on coarse GCM outputs. It introduces a two-step downscaling framework with coarse correction (p_{Z|X}) followed by a progressive multistage super-resolution (p_{Y|Z}), trained with the energy score, a proper multivariate scoring rule. The temporal extension EnScale-t adds autoregressive time-consistency, and a sparse local stochastic architecture enables scalable, location-specific variability modeling. Across multiple GCM–RCM pairs and four climate variables, EnScale achieves strong calibration, realistic spatial structure, reliable extremes, and favorable multivariate dependencies, while reducing computational cost by about an order of magnitude relative to diffusion-baseline methods. The work provides a comprehensive evaluation framework and demonstrates that stochastic, temporally-consistent emulation of RCMs is feasible and practically impactful for regional climate impact assessments.
Abstract
The practical use of future climate projections from global circulation models (GCMs) is often limited by their coarse spatial resolution, requiring downscaling to generate high-resolution data. Regional climate models (RCMs) provide this refinement, but are computationally expensive. To address this issue, machine learning models can learn the downscaling function, mapping coarse GCM outputs to high-resolution fields. Among these, generative approaches aim to capture the full conditional distribution of RCM data given coarse-scale GCM data, which is characterized by large variability and thus challenging to model accurately. We introduce EnScale, a generative machine learning framework that emulates the full GCM-to-RCM map by training on multiple pairs of GCM and corresponding RCM data. It first adjusts large-scale mismatches between GCM and coarsened RCM data, followed by a super-resolution step to generate high-resolution fields. Both steps employ generative models optimized with the energy score, a proper scoring rule. Compared to state-of-the-art ML downscaling approaches, our setup reduces computational cost by about one order of magnitude. EnScale jointly emulates multiple variables -- temperature, precipitation, solar radiation, and wind -- spatially consistent over an area in Central Europe. In addition, we propose a variant EnScale-t that enables temporally consistent downscaling. We establish a comprehensive evaluation framework across various categories including calibration, spatial structure, extremes, and multivariate dependencies. Comparison with diverse benchmarks demonstrates EnScale's strong performance and computational efficiency. EnScale offers a promising approach for accurate and temporally consistent RCM emulation.
