Data driven synthesis of provable invariant sets via stochastically sampled data
Amy K. Strong, Ali Kashani, Claus Danielson, Leila Bridgeman
TL;DR
This work tackles safety-critical control by constructing provable PI sets for possibly unmodeled discrete-time systems using only pre-collected data and a known Lipschitz bound $L$. The authors develop a data-driven, geometric approach that over-approximates one-step evolution with balls around sampled states and leverages $\epsilon$-net set coverage to certify invariance, implemented via a tree-based partitioning algorithm. They prove that the method yields a PI set that approximates the maximal PI set and provide deterministic and probabilistic bounds linking the number of samples $M$ to the volume of the resulting invariant set. The approach requires no full system model, only data $\mathcal{D}=\{\mathbf{x}_j, \mathbf{x}_j^+\}$ and Lipschitz information, and is validated on unmodeled linear and nonlinear 2D systems, showing PI-set volumes grow with more data and correlate with model-based or barrier-function benchmarks. This provides a practical, provable pathway to data-driven safety guarantees in environments with unmodeled dynamics.
Abstract
Positive invariant (PI) sets are essential for ensuring safety, i.e. constraint adherence, of dynamical systems. With the increasing availability of sampled data from complex (and often unmodeled) systems, it is advantageous to leverage these data sets for PI set synthesis. This paper uses data driven geometric conditions of invariance to synthesize PI sets from data. Where previous data driven, set-based approaches to PI set synthesis used deterministic sampling schemes, this work instead synthesizes PI sets from any pre-collected data sets. Beyond a data set and Lipschitz continuity, no additional information about the system is needed. A tree data structure is used to partition the space and select samples used to construct the PI set, while Lipschitz continuity is used to provide deterministic guarantees of invariance. Finally, probabilistic bounds are given on the number of samples needed for the algorithm to determine of a certain volume.
