Synthesis of Safety Specifications for Probabilistic Systems
Gaspard Ohlmann, Edwin Hamel-De le Court, Francesco Belardinelli
TL;DR
The paper tackles synthesizing policies for safety specifications in probabilistic systems using Continuing PCTL (CPCTL), a fragment of safe PCTL that supports nested probabilistic operators. It introduces an augmented MDP that encodes global CPCTL satisfaction as local linear constraints and proves a coherence (state/path compatibility) theorem to connect local constraints to global satisfaction. It then presents CPCTL-VI, a value-iteration algorithm that computes realizable augmented states and corresponding policies, with soundness and optimality under a generalized Slater assumption. The authors demonstrate the approach on gridworld-like models, producing Pareto-frontier policies that balance safety and performance, and show CPCTL can express richer safety requirements than traditional MOR/MOA fragments with practical synthesis guarantees.
Abstract
Ensuring that agents satisfy safety specifications can be crucial in safety-critical environments. While methods exist for controller synthesis with safe temporal specifications, most existing methods restrict safe temporal specifications to probabilistic-avoidance constraints. Formal methods typically offer more expressive ways to express safety in probabilistic systems, such as Probabilistic Computation Tree Logic (PCTL) formulas. Thus, in this paper, we develop a new approach that supports more general temporal properties expressed in PCTL. Our contribution is twofold. First, we develop a theoretical framework for the Synthesis of safe-PCTL specifications. We show how the reducing global specification satisfaction to local constraints, and define CPCTL, a fragment of safe-PCTL. We demonstrate how the expressiveness of CPCTL makes it a relevant fragment for the Synthesis Problem. Second, we leverage these results and propose a new Value Iteration-based algorithm to solve the synthesis problem for these more general temporal properties, and we prove the soundness and completeness of our method.
