Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

Marnix Suilen; Thom Badings; Eline M. Bovy; David Parker; Nils Jansen

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

Marnix Suilen, Thom Badings, Eline M. Bovy, David Parker, Nils Jansen

TL;DR

Robust MDPs address the limitation of standard MDPs that require precise transition probabilities by incorporating an uncertainty set $\mathcal{P}$. The paper surveys the theory and practical methods for solving RMDPs, including robust value iteration and robust policy iteration, under $(s,a)$-rectangular and related assumptions. It maps RMDPs to related models such as parametric MDPs, stochastic games, and robust POMDPs, and discusses applications in reinforcement learning, abstraction, and tool support with Prism and Storm, while outlining open challenges. This provides a unified perspective at the intersection of AI and formal methods for robust sequential decision making.

Abstract

Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the restrictive assumption that the transition probabilities need to be precisely known. Robust MDPs (RMDPs) overcome this assumption by instead defining the transition probabilities to belong to some uncertainty set. We present a gentle survey on RMDPs, providing a tutorial covering their fundamentals. In particular, we discuss RMDP semantics and how to solve them by extending standard MDP methods such as value iteration and policy iteration. We also discuss how RMDPs relate to other models and how they are used in several contexts, including reinforcement learning and abstraction techniques. We conclude with some challenges for future work on RMDPs.

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

TL;DR

Robust MDPs address the limitation of standard MDPs that require precise transition probabilities by incorporating an uncertainty set

. The paper surveys the theory and practical methods for solving RMDPs, including robust value iteration and robust policy iteration, under

-rectangular and related assumptions. It maps RMDPs to related models such as parametric MDPs, stochastic games, and robust POMDPs, and discusses applications in reinforcement learning, abstraction, and tool support with Prism and Storm, while outlining open challenges. This provides a unified perspective at the intersection of AI and formal methods for robust sequential decision making.

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

TL;DR

Abstract

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (1)

Theorems & Definitions (3)