Safe and Stable Control via Lyapunov-Guided Diffusion Models
Xiaoyuan Cheng, Xiaohang Tang, Yiming Yang
TL;DR
This work tackles the challenge of achieving safety and stability in diffusion-based control policies. It introduces Safe and Stable Diffusion (S^2Diff), a model-based diffusion planning framework that learns certificate functions inspired by Almost Lyapunov theory and uses diffusion sampling to generate trajectory-level policies without gradient-based QP constraints or control-affine assumptions. A probabilistic CLBF guided diffusion process ensures safety and stability by shaping a target distribution over trajectories and updating the CLBF from sampled data, with theoretical guarantees of almost-sure exponential convergence outside small dissipation-violating regions. Empirically, S^2Diff outperforms gradient-based certificate methods and model-based diffusion baselines across a range of nonlinear dynamical systems, demonstrating higher safety rates, better stability, and favorable evaluation times. The approach provides a flexible path to robust, safe learning-based control with potential for extension to richer neural certificate representations and faster inference through distillation.
Abstract
Diffusion models have made significant strides in recent years, exhibiting strong generalization capabilities in planning and control tasks. However, most diffusion-based policies remain focused on reward maximization or cost minimization, often overlooking critical aspects of safety and stability. In this work, we propose Safe and Stable Diffusion ($S^2$Diff), a model-based diffusion framework that explores how diffusion models can ensure safety and stability from a Lyapunov perspective. We demonstrate that $S^2$Diff eliminates the reliance on both complex gradient-based solvers (e.g., quadratic programming, non-convex solvers) and control-affine structures, leading to globally valid control policies driven by the learned certificate functions. Additionally, we uncover intrinsic connections between diffusion sampling and Almost Lyapunov theory, enabling the use of trajectory-level control policies to learn better certificate functions for safety and stability guarantees. To validate our approach, we conduct experiments on a wide variety of dynamical control systems, where $S^2$Diff consistently outperforms both certificate-based controllers and model-based diffusion baselines in terms of safety, stability, and overall control performance.
