Data-Driven Dynamic Controller Synthesis for Discrete-Time General Nonlinear Systems
Behrad Samari, Abolfazl Lavaei
TL;DR
This work tackles the problem of safety controller synthesis for discrete-time general nonlinear systems with unknown dynamics and input constraints. It introduces a direct data-driven framework that uses an adding-one-integrator augmentation to form an augmented dt-GNS, enabling learning of augmented CBCs and dynamic safety controllers from a single input-state trajectory via data-dependent LMIs (and an alternative scenario-based program). The key contributions are the design of augmented CBCs (A-CBCs) from data, a quadratic form for the CBC, and both deterministic and probabilistic guarantees, demonstrated on challenging nonlinear and high-dimensional case studies. The approach yields finite-horizon safety certificates without requiring full model identification and avoids bilinearities in the optimization, offering scalable, runtime-applicable safety guarantees for unknown nonlinear systems under input constraints.
Abstract
Synthesizing safety controllers for general nonlinear systems is a highly challenging task, particularly when the system models are unknown, and input constraints are present. While some recent efforts have explored data-driven safety controller design for nonlinear systems, these approaches are primarily limited to specific classes of nonlinear dynamics (e.g., polynomials) and are not applicable to general nonlinear systems. This paper develops a direct data-driven approach for discrete-time general nonlinear systems, facilitating the simultaneous learning of control barrier certificates (CBCs) and dynamic controllers to ensure safety properties under input constraints. Specifically, by leveraging the adding-one-integrator approach, we incorporate the controller's dynamics into the system dynamics to synthesize a virtual static-feedback controller for the augmented system, resulting in a dynamic safety controller for the actual dynamics. We collect input-state data from the augmented system during a finite-time experiment, referred to as a single trajectory. Using this data, we learn augmented CBCs and the corresponding virtual safety controllers, ensuring the safety of the actual system and adherence to input constraints over a finite time horizon. We demonstrate that our proposed conditions boil down to some data-dependent linear matrix inequalities (LMIs), which are easy to satisfy. We showcase the effectiveness of our data-driven approach through two case studies: one exhibiting significant nonlinearity and the other featuring high dimensionality.
