Lyapunov Stability Learning with Nonlinear Control via Inductive Biases
Yupu Lu, Shijie Lin, Hao Xu, Zeqing Zhang, Jia Pan
TL;DR
This work tackles stability guarantees for nonlinear control by learning neural control Lyapunov functions (CLFs) and CLF-based controllers within an end-to-end framework. It reframes Lyapunov conditions as inductive biases, enabling a self-supervised learning pipeline that jointly estimates system dynamics, a CLF, and a bounded controller, while integrating verification into training. The proposed sum-of-squares neural CLF with a bounded controller, coupled with a Lyapunov-based loss and optional geometric shaping, yields higher convergence rates and larger regions of attraction (ROA) than prior approaches, as demonstrated on unicycle PF, inverted pendulum, and extensions to more complex 4-DOF and 6-DOF systems. This approach simplifies implementation by avoiding external verifiers and improves robustness and scalability for safety-critical nonlinear control applications.
Abstract
Finding a control Lyapunov function (CLF) in a dynamical system with a controller is an effective way to guarantee stability, which is a crucial issue in safety-concerned applications. Recently, deep learning models representing CLFs have been applied into a learner-verifier framework to identify satisfiable candidates. However, the learner treats Lyapunov conditions as complex constraints for optimisation, which is hard to achieve global convergence. It is also too complicated to implement these Lyapunov conditions for verification. To improve this framework, we treat Lyapunov conditions as inductive biases and design a neural CLF and a CLF-based controller guided by this knowledge. This design enables a stable optimisation process with limited constraints, and allows end-to-end learning of both the CLF and the controller. Our approach achieves a higher convergence rate and larger region of attraction (ROA) in learning the CLF compared to existing methods among abundant experiment cases. We also thoroughly reveal why the success rate decreases with previous methods during learning.
