Synthesizing Neural Network Controllers with Closed-Loop Dissipativity Guarantees
Neelay Junnarkar, Murat Arcak, Peter Seiler
TL;DR
The paper tackles safe neural network control for uncertain linear time-invariant plants by enforcing closed-loop dissipativity through IQCs. It derives a dissipativity certificate for uncertain LTI systems and transforms a BMI-based controller synthesis condition into an LMIs-based formulation via a variable change, enabling a projection-based RL training loop that preserves dissipativity. The approach is instantiated with implicit/recurrent neural networks and validated on a nonlinear inverted pendulum and a flexible rod on a cart, showing competitive performance while providing formal L2 gain and stability guarantees. This contributes a practically applicable framework for training NN controllers with rigorous safety and robustness guarantees in safety-critical settings.
Abstract
In this paper, a method is presented to synthesize neural network controllers such that the feedback system of plant and controller is dissipative, certifying performance requirements such as L2 gain bounds. The class of plants considered is that of linear time-invariant (LTI) systems interconnected with an uncertainty, including nonlinearities treated as an uncertainty for convenience of analysis. The uncertainty of the plant and the nonlinearities of the neural network are both described using integral quadratic constraints (IQCs). First, a dissipativity condition is derived for uncertain LTI systems. Second, this condition is used to construct a linear matrix inequality (LMI) which can be used to synthesize neural network controllers. Finally, this convex condition is used in a projection-based training method to synthesize neural network controllers with dissipativity guarantees. Numerical examples on an inverted pendulum and a flexible rod on a cart are provided to demonstrate the effectiveness of this approach.
