Data-driven Interval MDP for Robust Control Synthesis
Rudi Coppola, Andrea Peruffo, Licio Romao, Alessandro Abate, Manuel Mazo
TL;DR
This work tackles data-driven robust control synthesis for discrete-time stochastic systems with unknown noise distribution by constructing finite-state abstractions that support reach-avoid specifications. It extends prior data-driven abstractions by incorporating nondeterminism in transitions through multiple-target sets, forming a robust MDP (RMDP) that is embedded into an interval MDP (iMDP) to leverage PAC guarantees via sampling. A key contribution is the use of scenario-based bounds to obtain PAC probability intervals for uncertain transitions, enabling valid lower bounds on satisfaction probabilities for the original system after policy refinement. Experimental results on a double-integrator and related models demonstrate that multi-target transitions provide richer controller synthesis at the cost of larger abstractions, with clear guidance on the tradeoffs and future directions for exploiting the structure of the uncertainty. Overall, the approach delivers data-driven, provably sound abstractions that enable reach-avoid control with enhanced flexibility and concrete PAC guarantees.
Abstract
The abstraction of dynamical systems is a powerful tool that enables the design of feedback controllers using a correct-by-design framework. We investigate a novel scheme to obtain data-driven abstractions of discrete-time stochastic processes in terms of richer discrete stochastic models, whose actions lead to nondeterministic transitions over the space of probability measures. The data-driven component of the proposed methodology lies in the fact that we only assume samples from an unknown probability distribution. We also rely on the model of the underlying dynamics to build our abstraction through backward reachability computations. The nondeterminism in the probability space is captured by a collection of Markov Processes, and we identify how this model can improve upon existing abstraction techniques in terms of satisfying temporal properties, such as safety or reach-avoid. The connection between the discrete and the underlying dynamics is made formal through the use of the scenario approach theory. Numerical experiments illustrate the advantages and main limitations of the proposed techniques with respect to existing approaches.
