Neural Lyapunov Differentiable Predictive Control
Sayak Mukherjee, Ján Drgoňa, Aaron Tuor, Mahantesh Halappanavar, Draguna Vrabie
TL;DR
This work tackles the computational burden of model predictive control by introducing Neural Lyapunov Differentiable Predictive Control (NLDPC), which jointly learns an explicit neural predictive policy and a neural Lyapunov function within a differentiable programming framework. The approach builds a differentiable closed-loop graph that integrates dynamics, costs, and Lyapunov constraints, enabling offline training via automatic differentiation and gradient-based optimization. It provides probabilistic stability guarantees using Hoeffding-based bounds over sampled initial conditions and demonstrates scalability through two numerical case studies: a double integrator and a PVTOL aircraft model. The results show that the learned Lyapunov function can expand the safe operating region while ensuring constraint satisfaction with reduced online computation, highlighting a practical path toward stability-certified, data-driven MPC for nonlinear systems.
Abstract
We present a learning-based predictive control methodology using the differentiable programming framework with probabilistic Lyapunov-based stability guarantees. The neural Lyapunov differentiable predictive control (NLDPC) learns the policy by constructing a computational graph encompassing the system dynamics, state and input constraints, and the necessary Lyapunov certification constraints, and thereafter using the automatic differentiation to update the neural policy parameters. In conjunction, our approach jointly learns a Lyapunov function that certifies the regions of state-space with stable dynamics. We also provide a sampling-based statistical guarantee for the training of NLDPC from the distribution of initial conditions. Our offline training approach provides a computationally efficient and scalable alternative to classical explicit model predictive control solutions. We substantiate the advantages of the proposed approach with simulations to stabilize the double integrator model and on an example of controlling an aircraft model.
