Training neural operators to preserve invariant measures of chaotic attractors
Ruoxi Jiang, Peter Y. Lu, Elena Orlova, Rebecca Willett
TL;DR
The paper tackles the problem of unstable long-horizon forecasting for chaotic dynamics by shifting the learning objective from trajectory RMSE to preserving invariant measures and time-invariant statistics on chaotic attractors. It introduces two complementary training paradigms in a multi-environment setting: (i) a physics-informed optimal transport loss that matches distributions of carefully chosen summary statistics using the entropy-regularized 2-Wasserstein distance $W^\gamma$ (Sinkhorn) and a bias-corrected Sinkhorn divergence, and (ii) a contrastive feature-learning loss that automatically extracts invariant statistics with an encoder via InfoNCE; both are combined with short-term RMSE. Empirical results on Lorenz-96 and Kuramoto–Sivashinsky show that OT and CL approaches yield substantially better long-term statistical fidelity (histograms, energy spectra, Lyapunov-related metrics) than RMSE alone, while maintaining competitive short-term forecasts. This work advances practical neural-operator emulation for chaotic systems, with potential impacts on climate modeling and turbulence where capturing long-term statistical structure is essential.
Abstract
Chaotic systems make long-horizon forecasts difficult because small perturbations in initial conditions cause trajectories to diverge at an exponential rate. In this setting, neural operators trained to minimize squared error losses, while capable of accurate short-term forecasts, often fail to reproduce statistical or structural properties of the dynamics over longer time horizons and can yield degenerate results. In this paper, we propose an alternative framework designed to preserve invariant measures of chaotic attractors that characterize the time-invariant statistical properties of the dynamics. Specifically, in the multi-environment setting (where each sample trajectory is governed by slightly different dynamics), we consider two novel approaches to training with noisy data. First, we propose a loss based on the optimal transport distance between the observed dynamics and the neural operator outputs. This approach requires expert knowledge of the underlying physics to determine what statistical features should be included in the optimal transport loss. Second, we show that a contrastive learning framework, which does not require any specialized prior knowledge, can preserve statistical properties of the dynamics nearly as well as the optimal transport approach. On a variety of chaotic systems, our method is shown empirically to preserve invariant measures of chaotic attractors.
