Distillation and Interpretability of Ensemble Forecasts of ENSO Phase using Entropic Learning
Michael Groom, Davide Bassetti, Illia Horenko, Terence J. O'Kane
TL;DR
This work addresses the interpretability gap in state-of-the-art ENSO forecasts produced by ensembles of entropy-optimal eSPA models trained on observational/reanalysis data. By distilling successful ensemble members into lead-time specific superclusters and associated affiliation/transition structures, the authors retain probabilistic skill while enabling rigorous diagnostics, including spatial importance maps and reconstructed precursor pathways. The distilled models show RPSS comparable to or better than baselines for long lead times, reveal physically consistent precursors, and trace canonical pathways from extratropical/remote signals to mature ENSO states. The approach provides a practical, diagnostically rich framework that complements real-time forecasts and deepens understanding of long-range ENSO predictability, with potential extensions to operational settings and broader fields.
Abstract
This paper introduces a distillation framework for an ensemble of entropy-optimal Sparse Probabilistic Approximation (eSPA) models, trained exclusively on satellite-era observational and reanalysis data to predict ENSO phase up to 24 months in advance. While eSPA ensembles yield state-of-the-art forecast skill, they are harder to interpret than individual eSPA models. We show how to compress the ensemble into a compact set of "distilled" models by aggregating the structure of only those ensemble members that make correct predictions. This process yields a single, diagnostically tractable model for each forecast lead time that preserves forecast performance while also enabling diagnostics that are impractical to implement on the full ensemble. An analysis of the regime persistence of the distilled model "superclusters", as well as cross-lead clustering consistency, shows that the discretised system accurately captures the spatiotemporal dynamics of ENSO. By considering the effective dimension of the feature importance vectors, the complexity of the input space required for correct ENSO phase prediction is shown to peak when forecasts must cross the boreal spring predictability barrier. Spatial importance maps derived from the feature importance vectors are introduced to identify where predictive information resides in each field and are shown to include known physical precursors at certain lead times. Case studies of key events are also presented, showing how fields reconstructed from distilled model centroids trace the evolution from extratropical and inter-basin precursors to the mature ENSO state. Overall, the distillation framework enables a rigorous investigation of long-range ENSO predictability that complements real-time data-driven operational forecasts.
