Stochastic Latent Transformer: Efficient Modelling of Stochastically Forced Zonal Jets
Ira J. S. Shokar, Rich R. Kerswell, Peter H. Haynes
TL;DR
The paper addresses efficient probabilistic modelling of stochastically forced zonal jets governed by SPDEs, focusing on the mean-flow field U(y,t) and its interaction with eddies. It introduces the Stochastic Latent Transformer (SLT), which combines a translation-equivariant autoencoder (TEPC) for a phase-aligned latent representation Z with a stochastic transformer that evolves Z via Z_{t+1} = \mathcal{T}_\varphi[Z_{t:t-L}, \epsilon], trained using CRPS and a spectral loss. The approach yields faithful short- and long-term statistics, matching spectral properties and transition-event distributions while delivering over five orders of magnitude speedup, enabling large ensembles for uncertainty quantification of spontaneous jet-transition events. This scalable framework invites extensions to higher-dimensional geophysical flows and transfer learning across regimes, facilitating robust, data-driven exploration of stochastic turbulence systems.
Abstract
We present a novel probabilistic deep learning approach, the 'Stochastic Latent Transformer' (SLT), designed for the efficient reduced-order modelling of stochastic partial differential equations. Stochastically driven flow models are pertinent to a diverse range of natural phenomena, including jets on giant planets, ocean circulation, and the variability of midlatitude weather. However, much of the recent progress in deep learning has predominantly focused on deterministic systems. The SLT comprises a stochastically-forced transformer paired with a translation-equivariant autoencoder, trained towards the Continuous Ranked Probability Score. We showcase its effectiveness by applying it to a well-researched zonal jet system, where the interaction between stochastically forced eddies and the zonal mean flow results in a rich low-frequency variability. The SLT accurately reproduces system dynamics across various integration periods, validated through quantitative diagnostics that include spectral properties and the rate of transitions between distinct states. The SLT achieves a five-order-of-magnitude speedup in emulating the zonally-averaged flow compared to direct numerical simulations. This acceleration facilitates the cost-effective generation of large ensembles, enabling the exploration of statistical questions concerning the probabilities of spontaneous transition events.
