Learning to Stabilize High-dimensional Unknown Systems Using Lyapunov-guided Exploration
Songyuan Zhang, Chuchu Fan
TL;DR
The paper tackles stabilizing high-dimensional unknown dynamical systems where full dynamics are unavailable or impractical to model. It introduces LYGE, a framework that learns local dynamics, a neural Control Lyapunov Function, and a stabilizing controller while Lyapunov-guided exploration selectively collects data within a growing trusted region toward the goal, ensuring stability within convergence. Empirically, LYGE achieves comparable or better stabilization than RL/IL baselines across six environments, notably reducing sample requirements by roughly 68%–95%, and scales to complex 16D/4D F-16 models. The approach also extends to learning alternative certificates, such as Control Contraction Metrics, highlighting its generality for stabilizing unknown systems where dynamics are partially learned. Overall, LYGE offers a practical, scalable route to data-efficient stabilization without requiring a full dynamical model, with promising avenues for formal verification and certificate-based extensions.
Abstract
Designing stabilizing controllers is a fundamental challenge in autonomous systems, particularly for high-dimensional, nonlinear systems that can hardly be accurately modeled with differential equations. The Lyapunov theory offers a solution for stabilizing control systems, still, current methods relying on Lyapunov functions require access to complete dynamics or samples of system executions throughout the entire state space. Consequently, they are impractical for high-dimensional systems. This paper introduces a novel framework, LYapunov-Guided Exploration (LYGE), for learning stabilizing controllers tailored to high-dimensional, unknown systems. LYGE employs Lyapunov theory to iteratively guide the search for samples during exploration while simultaneously learning the local system dynamics, control policy, and Lyapunov functions. We demonstrate its scalability on highly complex systems, including a high-fidelity F-16 jet model featuring a 16D state space and a 4D input space. Experiments indicate that, compared to prior works in reinforcement learning, imitation learning, and neural certificates, LYGE reduces the distance to the goal by 50% while requiring only 5% to 32% of the samples. Furthermore, we demonstrate that our algorithm can be extended to learn controllers guided by other certificate functions for unknown systems.
