Koopman-Equivariant Gaussian Processes
Petar Bevanda, Max Beier, Armin Lederer, Alexandre Capone, Stefan Sosnowski, Sandra Hirche
TL;DR
This work introduces Koopman-Equivariant Gaussian Processes (KE-GPs), a probabilistic framework that blends Gaussian process regression with Koopman operator structure to model dynamical systems whose evolution is linear in time but nonlinear in initial conditions. By encoding Koopman equivariance through a trajectory-based symmetrization and a spectral kernel construction, KE-GPs achieve closed-form multi-step trajectory posteriors with quantified uncertainty and improved generalization. The authors provide a theoretical analysis showing reduced sample complexity via information-gain rates, and develop a scalable variational inference scheme with inducing trajectories that leverages the KE structure to avoid time-context inducing points. Empirical results on predator-prey dynamics, robotic demonstrations, and weather data demonstrate competitive or superior forecasting and robust uncertainty quantification compared to kernel-based GP methods and Koopman regression baselines. This framework offers a principled, scalable approach for learning dynamical representations with tractable probabilistic forecasts and representations.
Abstract
Credible forecasting and representation learning of dynamical systems are of ever-increasing importance for reliable decision-making. To that end, we propose a family of Gaussian processes (GP) for dynamical systems with linear time-invariant responses, which are nonlinear only in initial conditions. This linearity allows us to tractably quantify forecasting and representational uncertainty, simultaneously alleviating the challenge of computing the distribution of trajectories from a GP-based dynamical system and enabling a new probabilistic treatment of learning Koopman operator representations. Using a trajectory-based equivariance -- which we refer to as \textit{Koopman equivariance} -- we obtain a GP model with enhanced generalization capabilities. To allow for large-scale regression, we equip our framework with variational inference based on suitable inducing points. Experiments demonstrate on-par and often better forecasting performance compared to kernel-based methods for learning dynamical systems.
