Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

Marius Belly; Nathanaël Fijalkow; Hugo Gimbert; Florian Horn; Guillermo A. Pérez; Pierre Vandenhove

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

Marius Belly, Nathanaël Fijalkow, Hugo Gimbert, Florian Horn, Guillermo A. Pérez, Pierre Vandenhove

TL;DR

This work addresses the decidability gap for almost-sure $\\omega$-regular (parity) objectives in POMDPs by introducing a revelation mechanism that enforces eventual full information. It defines weakly revealing and strongly revealing POMDPs and shows that, under these semantics, the problem reduces to analyzing a finite belief-support MDP, enabling exact algorithms. For weakly revealing POMDPs, parity objectives with priorities in $\{0,1,2\}$ are EXPTIME-complete, while $\{1,2,3\}$ is undecidable; strongly revealing POMDPs yield EXPTIME-complete decidability for parity, with two-player CoBüchi games remaining undecidable, and an optimistic (POMDP$_{sr}$) perspective broadens the practical relevance. An implementation sketch and comparisons to DRL highlight the practicality of the belief-support approach, illustrating a precise frontier between decidability and undecidability in POMDPs under revealing semantics.

Abstract

Partially observable Markov decision processes (POMDPs) form a prominent model for uncertainty in sequential decision making. We are interested in constructing algorithms with theoretical guarantees to determine whether the agent has a strategy ensuring a given specification with probability 1. This well-studied problem is known to be undecidable already for very simple omega-regular objectives, because of the difficulty of reasoning on uncertain events. We introduce a revelation mechanism which restricts information loss by requiring that almost surely the agent has eventually full information of the current state. Our main technical results are to construct exact algorithms for two classes of POMDPs called weakly and strongly revealing. Importantly, the decidable cases reduce to the analysis of a finite belief-support Markov decision process. This yields a conceptually simple and exact algorithm for a large class of POMDPs.

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

TL;DR

This work addresses the decidability gap for almost-sure

-regular (parity) objectives in POMDPs by introducing a revelation mechanism that enforces eventual full information. It defines weakly revealing and strongly revealing POMDPs and shows that, under these semantics, the problem reduces to analyzing a finite belief-support MDP, enabling exact algorithms. For weakly revealing POMDPs, parity objectives with priorities in

are EXPTIME-complete, while

is undecidable; strongly revealing POMDPs yield EXPTIME-complete decidability for parity, with two-player CoBüchi games remaining undecidable, and an optimistic (POMDP

) perspective broadens the practical relevance. An implementation sketch and comparisons to DRL highlight the practicality of the belief-support approach, illustrating a precise frontier between decidability and undecidability in POMDPs under revealing semantics.

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

TL;DR

Abstract

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (58)