Temporal Logic Control for Nonlinear Stochastic Systems Under Unknown Disturbances
Ibon Gracia, Luca Laurenti, Manuel Mazo, Alessandro Abate, Morteza Lahijanian
TL;DR
This work tackles the synthesis of robust controllers for discrete-time nonlinear systems with unknown disturbance distributions under $\mathrm{LTL}_f$ specifications. It introduces a data-driven, abstraction-based framework that learns a high-confidence Uncertain MDP (UMDP) with convex polytope transition uncertainty via a two-layer state discretization and disturbance-support learning, enabling tighter guarantees than traditional IMDP approaches. A tailored robust dynamic programming algorithm operates on the product of the UMDP with the $\mathrm{LTL}_f$ automaton, yielding provable bounds on the satisfaction probability and scalable synthesis. Empirical results across multiple nonlinear and multi-dimensional systems show substantial improvements in sample efficiency, bound tightness, and computation time compared to the state of the art. The approach thus offers a practical pathway to reliable, data-driven control under unknown disturbances for complex dynamical systems.
Abstract
In this paper, we present a novel framework to synthesize robust strategies for discrete-time nonlinear systems with random disturbances that are unknown, against temporal logic specifications. The proposed framework is data-driven and abstraction-based: leveraging observations of the system, our approach learns a high-confidence abstraction of the system in the form of an uncertain Markov decision process (UMDP). The uncertainty in the resulting UMDP is used to formally account for both the error in abstracting the system and for the uncertainty coming from the data. Critically, we show that for any given state-action pair in the resulting UMDP, the uncertainty in the transition probabilities can be represented as a convex polytope obtained by a two-layer state discretization and concentration inequalities. This allows us to obtain tighter uncertainty estimates compared to existing approaches, and guarantees efficiency, as we tailor a synthesis algorithm exploiting the structure of this UMDP. We empirically validate our approach on several case studies, showing substantially improved performance compared to the state-of-the-art.
