Reconstructing the Tropical Pacific Upper Ocean using Online Data Assimilation with a Deep Learning model
Zilu Meng, Gregory J. Hakim
TL;DR
This work develops and tests a transformer-based deep learning (DL) model for forecasting and reconstructing tropical Pacific upper-ocean states from sparse coral proxies, comparing it against a Linear Inverse Model (LIM). Trained on CMIP6 historical data and validated with SODA and GODAS reanalysis, the DL model forecasts 12 months ahead and forms priors for Ensemble Kalman Filter data assimilation; to address forecast variance loss, a inflation scheme injects hindcast-derived noise scaled by inter-model envelope. Across online assimilation experiments with 24 pseudo-proxies, DL yields higher domain-averaged correlations and better Nino3.4 reconstructions than LIM and offline baselines, with gains increasing as observation-averaging time grows. The results support DL as a computationally efficient priors approach for online paleoclimate data assimilation and motivate applying the method to real proxy data and broader regions.
Abstract
A deep learning (DL) model, based on a transformer architecture, is trained on a climate-model dataset and compared with a standard linear inverse model (LIM) in the tropical Pacific. We show that the DL model produces more accurate forecasts compared to the LIM when tested on a reanalysis dataset. We then assess the ability of an ensemble Kalman filter to reconstruct the monthly-averaged upper ocean from a noisy set of 24 sea-surface temperature observations designed to mimic existing coral proxy measurements, and compare results for the DL model and LIM. Due to signal damping in the DL model, we implement a novel inflation technique by adding noise from hindcast experiments. Results show that assimilating observations with the DL model yields better reconstructions than the LIM for observation averaging times ranging from one month to one year. The improved reconstruction is due to the enhanced predictive capabilities of the DL model, which map the memory of past observations to future assimilation times.
