Efficient Algorithms for Robust Markov Decision Processes with $s$-Rectangular Ambiguity Sets
Chin Pang Ho, Marek Petrik, Wolfram Wiesemann
TL;DR
The paper presents a unified framework for solving robust MDPs with $s$-rectangular ambiguity sets by reducing robust Bellman updates to structured projection subproblems. It provides exact efficient algorithms for weighted $1$- and $2$-norm ambiguity, and scalable approximate methods for common $\phi$-divergence sets (KL and Burg entropy), with clear complexity guarantees. The approach yields substantial practical speedups over state-of-the-art solvers across synthetic and benchmark MDPs, enabling scalable robust planning while preserving performance. The work integrates theoretical developments with comprehensive numerical experiments and demonstrates broad applicability to distributional robustness in dynamic programming contexts.
Abstract
Robust Markov decision processes (MDPs) have attracted significant interest due to their ability to protect MDPs from poor out-of-sample performance in the presence of ambiguity. In contrast to classical MDPs, which account for stochasticity by modeling the dynamics through a stochastic process with a known transition kernel, a robust MDP additionally accounts for ambiguity by optimizing against the most adverse transition kernel from an ambiguity set constructed via historical data. In this paper, we develop a unified solution framework for a broad class of robust MDPs with $s$-rectangular ambiguity sets, where the most adverse transition probabilities are considered independently for each state. Using our algorithms, we show that $s$-rectangular robust MDPs with $1$- and $2$-norm as well as $φ$-divergence ambiguity sets can be solved several orders of magnitude faster than with state-of-the-art commercial solvers, and often only a logarithmic factor slower than classical MDPs. We demonstrate the favorable scaling properties of our algorithms on a range of synthetically generated as well as standard benchmark instances.
