Continuity-Preserving Convolutional Autoencoders for Learning Continuous Latent Dynamical Models from Images
Aiqing Zhu, Yuting Pan, Qianxiao Li
TL;DR
The paper tackles learning continuous latent dynamical models from sequences of image observations, where pixel data are discrete yet the underlying dynamics are continuous. It introduces continuity-preserving autoencoders (CpAEs) that enforce δ-continuity by constraining early CNN filters to be Lipschitz and by employing a nonlocal regularizer to promote smooth filter changes, ensuring that latent states $Z$ evolve coherently with the true dynamics. A theoretical result connects δ-continuity of the encoder to the continuity of the latent trajectories, and the model is trained in two steps: (i) learn a CpAE with a continuity-aware loss, and (ii) fit a continuous latent dynamic model (e.g., Neural ODE) for $Z$. Empirical evaluation on synthetic and real-world motion datasets shows CpAEs yield continuous latent representations and improved predictive performance over standard autoencoders and several baselines, with potential for future improvements via hypernetworks and transformer-based architectures.
Abstract
Continuous dynamical systems are cornerstones of many scientific and engineering disciplines. While machine learning offers powerful tools to model these systems from trajectory data, challenges arise when these trajectories are captured as images, resulting in pixel-level observations that are discrete in nature. Consequently, a naive application of a convolutional autoencoder can result in latent coordinates that are discontinuous in time. To resolve this, we propose continuity-preserving convolutional autoencoders (CpAEs) to learn continuous latent states and their corresponding continuous latent dynamical models from discrete image frames. We present a mathematical formulation for learning dynamics from image frames, which illustrates issues with previous approaches and motivates our methodology based on promoting the continuity of convolution filters, thereby preserving the continuity of the latent states. This approach enables CpAEs to produce latent states that evolve continuously with the underlying dynamics, leading to more accurate latent dynamical models. Extensive experiments across various scenarios demonstrate the effectiveness of CpAEs.
