A Hybrid Deep-Learning Model for El Niño Southern Oscillation in the Low-Data Regime

Jakob Schloer; Matthew Newman; Jannik Thuemmel; Antonietta Capotondi; Bedartha Goswami

A Hybrid Deep-Learning Model for El Niño Southern Oscillation in the Low-Data Regime

Jakob Schloer, Matthew Newman, Jannik Thuemmel, Antonietta Capotondi, Bedartha Goswami

TL;DR

This study addresses the challenge of long-range ENSO forecasting in data-sparse regimes by combining a cyclostationary Linear Inverse Model (LIM) with a non-Markovian LSTM to learn residual nonlinear dynamics. The LIM-LSTM hybrid, trained on a 2000-year CESM2 pre-industrial control dataset and conditioned on seasonality, achieves higher skill than the LIM alone and competes with fully deep-learning baselines while requiring far less data and far fewer parameters. Key findings show that nonlinear asymmetries between warm and cold ENSO events are effectively captured, especially for leads of 9–18 months in the western tropical Pacific, improving both deterministic and probabilistic forecast metrics. The work demonstrates a data-efficient path to robust S2S climate forecasts and highlights the potential for applying similar hybrids to other low-data regime problems, with domain adaptation to observational data as a promising future direction.

Abstract

While deep-learning models have demonstrated skillful El Niño Southern Oscillation (ENSO) forecasts up to one year in advance, they are predominantly trained on climate model simulations that provide thousands of years of training data at the expense of introducing climate model biases. Simpler Linear Inverse Models (LIMs) trained on the much shorter observational record also make skillful ENSO predictions but do not capture predictable nonlinear processes. This motivates a hybrid approach, combining the LIMs modest data needs with a deep-learning non-Markovian correction of the LIM. For O(100 yr) datasets, our resulting Hybrid model is more skillful than the LIM while also exceeding the skill of a full deep-learning model. Additionally, while the most predictable ENSO events are still identified in advance by the LIM, they are better predicted by the Hybrid model, especially in the western tropical Pacific for leads beyond about 9 months, by capturing the subsequent asymmetric (warm versus cold phases) evolution of ENSO.

A Hybrid Deep-Learning Model for El Niño Southern Oscillation in the Low-Data Regime

TL;DR

Abstract

A Hybrid Deep-Learning Model for El Niño Southern Oscillation in the Low-Data Regime

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)