Widest Path Games and Maximality Inheritance in Bounded Value Iteration for Stochastic Games
Kittiphon Phalakarn, Yun Chen Tsai, Ichiro Hasuo
TL;DR
This paper identifies and formalizes the core principles underlying the widest path-based BVI approach by presenting 2WP-BVI, a clean BVI algorithm based on (2-player) widest path games, and proving its correctness using what is called the maximality inheritance principle.
Abstract
For model checking stochastic games (SGs), bounded value iteration (BVI) algorithms have gained attention as efficient approximate methods with rigorous precision guarantees. However, BVI may not terminate or converge when the target SG contains end components. Most existing approaches address this issue by explicitly detecting and processing end components--a process that is often computationally expensive. An exception is the widest path-based BVI approach previously studied by Phalakarn et al., which we refer to as 1WP-BVI. The method performs particularly well in the presence of numerous end components. Nonetheless, its theoretical foundations remain somewhat ad hoc. In this paper, we identify and formalize the core principles underlying the widest path-based BVI approach by (i) presenting 2WP-BVI, a clean BVI algorithm based on (2-player) widest path games, and (ii) proving its correctness using what we call the maximality inheritance principle--a proof principle previously employed in a well-known result in probabilistic model checking. Our experimental results demonstrate the practical relevance and potential of our proposed 2WP-BVI algorithm.
