Learning invariant representations of time-homogeneous stochastic dynamical systems
Vladimir R. Kostic, Pietro Novelli, Riccardo Grazzi, Karim Lounici, Massimiliano Pontil
TL;DR
The paper addresses learning invariant representations for time-homogeneous stochastic dynamics to enable accurate estimation of the transfer operator $\mathcal{T}$ or generator $\mathcal{L}$. It introduces Deep Projection Networks (DPNets) that optimize projection-based scores $\mathcal{P}$ and $\mathcal{S}^\gamma$ to identify leading invariant subspaces, with a differentiable relaxation that improves numerical stability and a continuous-time extension for time-reversal invariant systems. Theoretical results show that these scores capture the leading invariant subspace under compactness (and in the continuous case under self-adjointness) and link representation learning with operator regression (EDMD) for forecasting and spectral analysis. Extensive experiments across logistic maps, MNIST-like dynamics, fluid flow, and molecular systems demonstrate robustness and scalability, highlighting the practical impact for data-driven dynamical systems.
Abstract
We consider the general class of time-homogeneous stochastic dynamical systems, both discrete and continuous, and study the problem of learning a representation of the state that faithfully captures its dynamics. This is instrumental to learning the transfer operator or the generator of the system, which in turn can be used for numerous tasks, such as forecasting and interpreting the system dynamics. We show that the search for a good representation can be cast as an optimization problem over neural networks. Our approach is supported by recent results in statistical learning theory, highlighting the role of approximation error and metric distortion in the learning problem. The objective function we propose is associated with projection operators from the representation space to the data space, overcomes metric distortion, and can be empirically estimated from data. In the discrete-time setting, we further derive a relaxed objective function that is differentiable and numerically well-conditioned. We compare our method against state-of-the-art approaches on different datasets, showing better performance across the board.
