Measure-Theoretic Time-Delay Embedding
Jonah Botvinick-Greenhouse, Maria Oprea, Romit Maulik, Yunan Yang
TL;DR
This work generalizes Takens' time-delay embedding from trajectories to distributions by formulating a measure-theoretic embedding between probability spaces using pushforwards, grounded in optimal transport on $\mathcal{P}_2(M)$. It establishes a main theorem showing that if the classical delay map is an embedding, its pushforward to probability measures is also an embedding, enabling a robust measure-valued reconstruction of full system states. The authors develop a computational procedure that learns the inverse embedding via a measure-theoretic loss (e.g., $\mathcal{L}_{\text{m}}$ using MMD) and cluster-based empirical measures to handle sparse/noisy data. Demonstrations on Lorenz-63 and related chaotic systems, plus NOAA SST and ERA5 wind datasets, show that the measure-based approach yields smoother reconstructions and better generalization under noise and data scarcity, suggesting a practical, scalable alternative to traditional pointwise learning in data-driven dynamics.
Abstract
The celebrated Takens' embedding theorem provides a theoretical foundation for reconstructing the full state of a dynamical system from partial observations. However, the classical theorem assumes that the underlying system is deterministic and that observations are noise-free, limiting its applicability in real-world scenarios. Motivated by these limitations, we formulate a measure-theoretic generalization that adopts an Eulerian description of the dynamics and recasts the embedding as a pushforward map between spaces of probability measures. Our mathematical results leverage recent advances in optimal transport. Building on the proposed measure-theoretic time-delay embedding theory, we develop a computational procedure that aims to reconstruct the full state of a dynamical system from time-lagged partial observations, engineered with robustness to handle sparse and noisy data. We evaluate our measure-based approach across several numerical examples, ranging from the classic Lorenz-63 system to real-world applications such as NOAA sea surface temperature reconstruction and ERA5 wind field reconstruction.
