Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting
Elena Orlova, Haokun Liu, Raphael Rossellini, Benjamin A. Cash, Rebecca Willett
TL;DR
This work tackles subseasonal forecasting by treating lagged physics-based ensemble forecasts as rich features for ML post-processing. By incorporating the full ensemble (not just the mean), lagged observations, SST PCs, and spatial encodings within LR, RF, and U-Net architectures, and by employing nonlinear stacking, the authors systematically improve predictions of monthly precipitation and 2 m temperature two weeks ahead over the CONUS. Across regression, quantile, and tercile tasks, the stacked models consistently outperform climatology and ensemble-mean baselines, with notable gains in temperature forecasts and extreme-event prediction. The study demonstrates the value of ensemble diversity and spatial information for SSF and outlines promising future directions such as transformers and uncertainty quantification to further enhance operational relevance.
Abstract
Producing high-quality forecasts of key climate variables, such as temperature and precipitation, on subseasonal time scales has long been a gap in operational forecasting. This study explores an application of machine learning (ML) models as post-processing tools for subseasonal forecasting. Lagged numerical ensemble forecasts (i.e., an ensemble where the members have different initialization dates) and observational data, including relative humidity, pressure at sea level, and geopotential height, are incorporated into various ML methods to predict monthly average precipitation and two-meter temperature two weeks in advance for the continental United States. For regression, quantile regression, and tercile classification tasks, we consider using linear models, random forests, convolutional neural networks, and stacked models (a multi-model approach based on the prediction of the individual ML models). Unlike previous ML approaches that often use ensemble mean alone, we leverage information embedded in the ensemble forecasts to enhance prediction accuracy. Additionally, we investigate extreme event predictions that are crucial for planning and mitigation efforts. Considering ensemble members as a collection of spatial forecasts, we explore different approaches to using spatial information. Trade-offs between different approaches may be mitigated with model stacking. Our proposed models outperform standard baselines such as climatological forecasts and ensemble means. In addition, we investigate feature importance, trade-offs between using the full ensemble or only the ensemble mean, and different modes of accounting for spatial variability.
