Safety-Critical Control with Offline-Online Neural Network Inference
Junhui Zhang, Sze Zheng Yong, Dimitra Panagou
TL;DR
This work addresses safe motion control for an ego agent operating among other agents with unknown dynamics. It jointly learns the other agents' dynamics offline with radial basis function neural networks and refines the model online using concurrent learning, eliminating the need for persistent excitation. Adaptive conformal prediction provides online, high-probability prediction sets for the learned dynamics, which are embedded into a sampled-data control barrier function framework to guarantee safety with high average confidence. The approach reduces conservatism compared to fixed bounds and demonstrates effective safety guarantees in a multi-agent simulation. The combination of offline-online learning, ACP uncertainty quantification, and CBF-based safety offers a practical path for real-time, safety-critical autonomy in dynamic environments.
Abstract
This paper presents a safety-critical control framework for an ego agent moving among other agents. The approach infers the dynamics of the other agents, and incorporates the inferred quantities into the design of control barrier function (CBF)-based controllers for the ego agent. The inference method combines offline and online learning with radial basis function neural networks (RBFNNs). The RBFNNs are initially trained offline using collected datasets. To enhance the generalization of the RBFNNs, the weights are then updated online with new observations, without requiring persistent excitation conditions in order to enhance the applicability of the method. Additionally, we employ adaptive conformal prediction to quantify the estimation error of the RBFNNs for the other agents' dynamics, generating prediction sets to cover the true value with high probability. Finally, we formulate a CBF-based controller for the ego agent to guarantee safety with the desired confidence level by accounting for the prediction sets of other agents' dynamics in the sampled-data CBF conditions. Simulation results are provided to demonstrate the effectiveness of the proposed method.
