Two-Stage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion

Haoyu Li; Xiangru Zhong; Bin Hu; Huan Zhang

Two-Stage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion

Haoyu Li, Xiangru Zhong, Bin Hu, Huan Zhang

TL;DR

This work tackles the challenge of certifying stability for learning-based neural controllers in continuous-time systems by jointly learning a controller and a Lyapunov certificate. It introduces a two-stage training pipeline anchored in Zubov’s ROA characterization: (i) a Zubov-guided ROA estimation stage with dynamic curriculum-driven domain expansion, followed by (ii) a CEGIS-based refinement that eliminates counterexamples inside the ROA for verifiable stabilization. The verification component extends the α,β-CROWN framework to handle Jacobian operators in continuous time and employs adaptive region bounds to avoid costly bisection, achieving substantial speedups over SMT-based solvers. Empirically, the approach yields ROAs up to five orders of magnitude larger than baselines and accelerates verification by up to four orders of magnitude on continuous-time benchmarks, with results supported by multiple seeds and ablation studies. The work advances scalable, certifiable stabilization for nonlinear systems and provides open-source code for reproducibility.

Abstract

Learning-based neural network (NN) control policies have shown impressive empirical performance. However, obtaining stability guarantees and estimates of the region of attraction of these learned neural controllers is challenging due to the lack of stable and scalable training and verification algorithms. Although previous works in this area have achieved great success, much conservatism remains in their frameworks. In this work, we propose a novel two-stage training framework to jointly synthesize a controller and a Lyapunov function for continuous-time systems. By leveraging a Zubov-inspired region of attraction characterization to directly estimate stability boundaries, we propose a novel training-data sampling strategy and a domain-updating mechanism that significantly reduces the conservatism in training. Moreover, unlike existing works on continuous-time systems that rely on an SMT solver to formally verify the Lyapunov condition, we extend state-of-the-art neural network verifier $α,\!β$-CROWN with the capability of performing automatic bound propagation through the Jacobian of dynamical systems and a novel verification scheme that avoids expensive bisection. To demonstrate the effectiveness of our approach, we conduct numerical experiments by synthesizing and verifying controllers on several challenging nonlinear systems across multiple dimensions. We show that our training can yield region of attractions with volume $5 - 1.5\cdot 10^{5}$ times larger compared to the baselines, and our verification on continuous systems can be up to $40-10{,}000$ times faster compared to the traditional SMT solver dReal. Our code is available at https://github.com/Verified-Intelligence/Two-Stage_Neural_Controller_Training.

Two-Stage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion

TL;DR

Abstract

Two-Stage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (6)