Stochastic Shortest Path Problem with Failure Probability
Ritsusamuel Otsubo
TL;DR
This work extends the stochastic shortest path framework to explicitly handle failure risk by introducing dead-ends and an allowable failure threshold $\epsilon$. It jointly optimizes a policy using a BAMDP perspective and a two-player zero-sum game, resulting in a $J_{c,\gamma,\epsilon}$ objective that blends the cost of successful episodes with a penalty for potential failures. The authors develop finite-approximation schemes (Case S and Case M) that enable practical computation via value iteration on bounded MDPs, with theoretical guarantees as $\gamma\to1$ and $M\to\infty$. They validate the approach on a motion-planning problem with obstacle avoidance, showing that expanding beyond conservative max-prob policies yields faster, less costly routes while keeping the failure probability within the prescribed bound. The framework thus offers a principled, scalable method for risk-aware sequential decision making in uncertain environments.
Abstract
We solve a sequential decision-making problem under uncertainty that takes into account the failure probability of a task. This problem cannot be handled by the stochastic shortest path problem, which is the standard model for sequential decision-making. This problem is addressed by introducing dead-ends. Conventionally, we only consider policies that minimize the probability of task failure, so the optimal policy constructed could be overly conservative. In this paper, we address this issue by expanding the search range to a class of policies whose failure probability is less than a desired threshold. This problem can be solved by treating it as a framework of a Bayesian Markov decision process and a two-person zero-sum game. Also, it can be seen that the optimal policy is expressed in the form of a probability distribution on a set of deterministic policies. We also demonstrate the effectiveness of the proposed methods by applying them to a motion planning problem with obstacle avoidance for a moving robot.
