Shield Synthesis for LTL Modulo Theories
Andoni Rodriguez, Guy Amir, Davide Corsi, Cesar Sanchez, Guy Katz
TL;DR
This work extends shielding for safety guarantees from propositional LTL to $LTL_{\,\mathcal{T}}$, enabling rich data and temporal reasoning in DRL-controlled systems. It combines Boolean abstraction with reactive synthesis modulo theories to produce shields, offering two main synthesis pathways: one via a Boolean controller augmented with a provider to yield a $C_{\mathcal{T}}$, and another via shields derived from winning regions to achieve maximal permissivity. The authors formalize minimal/feasible vs. maximally permissive shields, and introduce optimization techniques (e.g., soft constraints and bounded synthesis) to produce safe corrections that are close to the original potentially unsafe outputs. Empirical results demonstrate applicability to complex data with temporal dynamics and show efficiency gains over prior approaches, expanding shield applicability beyond Boolean specifications into expressive theories.
Abstract
In recent years, Machine Learning (ML) models have achieved remarkable success in various domains. However, these models also tend to demonstrate unsafe behaviors, precluding their deployment in safety-critical systems. To cope with this issue, ample research focuses on developing methods that guarantee the safe behaviour of a given ML model. A prominent example is shielding which incorporates an external component (a ``shield'') that blocks unwanted behavior. Despite significant progress, shielding suffers from a main setback: it is currently geared towards properties encoded solely in propositional logics (e.g., LTL) and is unsuitable for richer logics. This, in turn, limits the widespread applicability of shielding in many real-world systems. In this work, we address this gap, and extend shielding to LTL modulo theories, by building upon recent advances in reactive synthesis modulo theories. This allowed us to develop a novel approach for generating shields conforming to complex safety specifications in these more expressive, logics. We evaluated our shields and demonstrate their ability to handle rich data with temporal dynamics. To the best of our knowledge, this is the first approach for synthesizing shields for such expressivity.
