Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space
Maximilian Stölzle, Cosimo Della Santina
TL;DR
This work tackles latent-space control of physical systems from high-dimensional observations by introducing Input-to-State Stable Coupled Oscillator Networks (CON), a Lagrangian-formulated, ISS-stable latent-dynamics model for learning dynamics and enabling model-based control in latent space. The authors prove Global Asymptotic Stability for the unforced CON via a Lyapunov candidate $V_\mu(\tilde{y}_w)$ and Global Input-to-State Stability for the forced system, while learning in pixels through a $\beta$-VAE and leveraging a potential-energy-based controller that combines feedforward potential compensation with a P-satI-D feedback term. To accelerate training, they derive CFA-CON, an approximate closed-form rollout that decouples linear dynamics from nonlinear residuals, and show CFA-CON can double training speed with minimal loss in accuracy. Empirically, CON and CFA-CON achieve competitive or state-of-the-art prediction across several mechanical and soft-robot datasets, with CON-M and CFA-CON delivering strong results on actuated systems, and a latent-space PID-based controller leveraging the learned potential guiding effective tracking. The work thus provides principled stability guarantees, efficient training, and practical model-based control capabilities directly from image-based observations, with clear avenues for future extension to more complex or non-ideal physical systems.
Abstract
Even though a variety of methods have been proposed in the literature, efficient and effective latent-space control (i.e., control in a learned low-dimensional space) of physical systems remains an open challenge. We argue that a promising avenue is to leverage powerful and well-understood closed-form strategies from control theory literature in combination with learned dynamics, such as potential-energy shaping. We identify three fundamental shortcomings in existing latent-space models that have so far prevented this powerful combination: (i) they lack the mathematical structure of a physical system, (ii) they do not inherently conserve the stability properties of the real systems, (iii) these methods do not have an invertible mapping between input and latent-space forcing. This work proposes a novel Coupled Oscillator Network (CON) model that simultaneously tackles all these issues. More specifically, (i) we show analytically that CON is a Lagrangian system - i.e., it possesses well-defined potential and kinetic energy terms. Then, (ii) we provide formal proof of global Input-to-State stability using Lyapunov arguments. Moving to the experimental side, we demonstrate that CON reaches SoA performance when learning complex nonlinear dynamics of mechanical systems directly from images. An additional methodological innovation contributing to achieving this third goal is an approximated closed-form solution for efficient integration of network dynamics, which eases efficient training. We tackle (iii) by approximating the forcing-to-input mapping with a decoder that is trained to reconstruct the input based on the encoded latent space force. Finally, we show how these properties enable latent-space control. We use an integral-saturated PID with potential force compensation and demonstrate high-quality performance on a soft robot using raw pixels as the only feedback information.
