Certified Training with Branch-and-Bound for Lyapunov-stable Neural Control
Zhouxing Shi, Haoyu Li, Cho-Jui Hsieh, Huan Zhang
TL;DR
This work introduces CT-BaB, a certified training framework that integrates verification-aware objectives into training for Lyapunov-stable neural controllers in discrete-time systems. By combining differentiable certified bounds from linear-relaxation verifiers with a training-time branch-and-bound (BaB) strategy, CT-BaB optimizes bounds over a region-of-interest instead of individual counterexamples, producing models with larger ROA and faster test-time verification. A dynamic training dataset of subregions is maintained and refined during training, and the resulting subregions can be exported to bootstrap test-time verification, achieving significant speedups (e.g., up to 164× larger ROA and 11–80× faster verification on challenging 2D quadrotor settings). This approach advances verification-friendly neural control by aligning training with verification, enabling robust, scalable guarantees within a global ROI while highlighting future work to scale to higher-dimensional systems and extend to other safety properties.
Abstract
We study the problem of learning verifiably Lyapunov-stable neural controllers that provably satisfy the Lyapunov asymptotic stability condition within a region-of-attraction (ROA). Unlike previous works that adopted counterexample-guided training without considering the computation of verification in training, we introduce Certified Training with Branch-and-Bound (CT-BaB), a new certified training framework that optimizes certified bounds, thereby reducing the discrepancy between training and test-time verification that also computes certified bounds. To achieve a relatively global guarantee on an entire input region-of-interest, we propose a training-time BaB technique that maintains a dynamic training dataset and adaptively splits hard input subregions into smaller ones, to tighten certified bounds and ease the training. Meanwhile, subregions created by the training-time BaB also inform test-time verification, for a more efficient training-aware verification. We demonstrate that CT-BaB yields verification-friendly models that can be more efficiently verified at test time while achieving stronger verifiable guarantees with larger ROA. On the largest output-feedback 2D Quadrotor system experimented, CT-BaB reduces verification time by over 11X relative to the previous state-of-the-art baseline while achieving 164X larger ROA.
