Scalable Surrogate Verification of Image-based Neural Network Control Systems using Composition and Unrolling
Feiyang Cai, Chuchu Fan, Stanley Bak
TL;DR
This work tackles safety verification for image-based NNCS by replacing the real perception with a cGAN-based surrogate and addressing two primary sources of overapproximation: one-step and multi-step errors. The authors propose two complementary improvements—composition of the discrete-time dynamics with the surrogate network to preserve state-output dependencies, and unrolling multiple control steps into a single verification target—to enable more accurate and scalable reachability analysis. Through two case studies, autonomous aircraft taxiing and advanced emergency braking, the approach substantially reduces overapproximation (e.g., a 175% reduction in the taxiing case) and enables verification where prior baselines fail, even with higher-dimensional perception outputs and modern architectures. The results demonstrate practical gains in safety certification for image-based NNCS while also highlighting remaining scalability challenges related to network size and higher-resolution perceptual inputs.
Abstract
Verifying safety of neural network control systems that use images as input is a difficult problem because, from a given system state, there is no known way to mathematically model what images are possible in the real-world. We build on recent work that considers a surrogate verification approach, training a conditional generative adversarial network (cGAN) as an image generator in place of the real world. This enables set-based formal analysis of the closed-loop system, providing analysis beyond simulation and testing. While existing work is effective on small examples, excessive overapproximation both within a single control period and across multiple control periods limits its scalability. We propose approaches to overcome these two sources of error. First, we overcome one-step error by composing the system's dynamics along with the cGAN and neural network controller, without losing the dependencies between input states and the control outputs as in the monotonic analysis of the system dynamics. Second, we reduce multi-step error by repeating the single-step composition, essentially unrolling multiple steps of the control loop into a large neural network. We then leverage existing network verification tools to compute accurate reachable sets for multiple steps, avoiding the accumulation of abstraction error at each step. We demonstrate the effectiveness of our approach in terms of both accuracy and scalability using two case studies: an autonomous aircraft taxiing system and an advanced emergency braking system. On the aircraft taxiing system, the converged reachable set is 175% larger using the prior baseline method compared with our proposed approach. On the emergency braking system, with 24x the number of image output variables from the cGAN, the baseline method fails to prove any states are safe, whereas our improvements enable set-based safety analysis.
