Autoencoding Dynamics: Topological Limitations and Capabilities
Matthew D. Kvalheim, Eduardo D. Sontag
TL;DR
This work addresses whether low-dimensional dynamical systems on a manifold $M$ can be faithfully represented by autoencoders with latent space $\mathbb{R}^\ell$. It proves strong topological limitations (e.g., when $m>\ell$ there is a global obstruction to perfect round-trips) and, for the equal-dimension static case, establishes large-measure regions on $M$ where ideal encodings exist via contractible neighborhoods and local encoder/decoder pairs, with $L^p$-losses arbitrarily small. Extending to dynamics, the authors show that one can intertwine the true flow on $M$ with a latent flow on $\mathbb{R}^m$ on these regions, and, under Euclidean-covering conditions, achieve large-time intertwinings and linear latent behavior on basins of attraction. The paper also provides a concrete S^1 example with a 1D latent space, detailing a two-phase training procedure and a loss-function design to realize static and dynamic encodings, illustrating practical feasibility alongside the theoretical limits. Overall, the results bridge topology, geometry, and learning to delineate what autoencoders can and cannot achieve for manifold-based dynamical representations and point toward topology-respecting architectures and control-oriented latent dynamics.
Abstract
Given a "data manifold" $M\subset \mathbb{R}^n$ and "latent space" $\mathbb{R}^\ell$, an autoencoder is a pair of continuous maps consisting of an "encoder" $E\colon \mathbb{R}^n\to \mathbb{R}^\ell$ and "decoder" $D\colon \mathbb{R}^\ell\to \mathbb{R}^n$ such that the "round trip" map $D\circ E$ is as close as possible to the identity map $\mbox{id}_M$ on $M$. We present various topological limitations and capabilites inherent to the search for an autoencoder, and describe capabilities for autoencoding dynamical systems having $M$ as an invariant manifold.
