Training Verification-Friendly Neural Networks via Neuron Behavior Consistency

Zongxin Liu; Zhe Zhao; Fu Song; Jun Sun; Pengfei Yang; Xiaowei Huang; Lijun Zhang

Training Verification-Friendly Neural Networks via Neuron Behavior Consistency

Zongxin Liu, Zhe Zhao, Fu Song, Jun Sun, Pengfei Yang, Xiaowei Huang, Lijun Zhang

TL;DR

This work tackles the verification-time bottleneck of neural networks in safety-critical settings by introducing neuron behavior consistency (NBC) as a training regularizer. NBC encourages activation stability across inputs within a local neighborhood, reducing unstable neurons and tightening activation bounds to facilitate formal verification. Empirical results across MNIST, Fashion-MNIST, and CIFAR-10 show NBC-trained networks achieve faster verification, higher stable-neuron ratios, and greater UNSAT% while maintaining competitive robustness, and NBC can be effectively combined with existing training approaches to further boost verifiability. The method demonstrates practical impact by enabling verification-friendly networks for larger architectures without prohibitive training costs, with some trade-offs in accuracy that can be managed by tuning hyperparameters and combining with other verifiers or training schemes.

Abstract

Formal verification provides critical security assurances for neural networks, yet its practical application suffers from the long verification time. This work introduces a novel method for training verification-friendly neural networks, which are robust, easy to verify, and relatively accurate. Our method integrates neuron behavior consistency into the training process, making neuron activation states remain consistent across different inputs within a local neighborhood. This reduces the number of unstable neurons and tightens the bounds of neurons thereby enhancing the network's verifiability. We evaluated our method using the MNIST, Fashion-MNIST, and CIFAR-10 datasets with various network architectures. The experimental results demonstrate that networks trained using our method are verification-friendly across different radii and architectures, whereas other tools fail to maintain verifiability as the radius increases. Additionally, we show that our method can be combined with existing approaches to further improve the verifiability of networks.

Training Verification-Friendly Neural Networks via Neuron Behavior Consistency

TL;DR

Abstract

Training Verification-Friendly Neural Networks via Neuron Behavior Consistency

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)

Theorems & Definitions (1)