Prophet Inequalities for Bandits, Cabinets, and DAGs
Robin Bowers, Elias Lindgren, Bo Waggoner
TL;DR
This work addresses online selection among multiple costly Markov search processes (MSPs) on a finite acyclic graph, seeking a feasible subset that maximizes welfare. It develops a robust local-to-global approach using SAUP (single-agent utility problem) and threshold-based policies, leveraged through reductions to Pandora's Cabinets and bandit indices to handle highly interactive MSPs without relying on index theorems. The main contribution is a computationally efficient $\frac{1}{2}-\epsilon$ prophet inequality for Combinatorial Markov Search under any matroid constraint, together with a polynomial-time method to approximate the ex-ante optimum via convex optimization and an FPTAS. The results unify Bandits, Cabinets, and DAGs into a coherent framework, enabling incentive-compatible mechanisms with constant Price of Anarchy in settings where agents perform costly, strategic search, and extend classical prophet inequalities to broad interactive decision problems.
Abstract
A decisionmaker faces $n$ alternatives, each of which represents a potential reward. After investing costly resources into investigating the alternatives, the decisionmaker may select one, or more generally a feasible subset, and obtain the associated reward(s). The objective is to maximize the sum of rewards minus total costs invested. We consider this problem under a general model of an alternative as a "Markov Search Process," a type of undiscounted Markov Decision Process on a finite acyclic graph. Even simple cases generalize NP-hard problems such as Pandora's Box with nonobligatory inspection. Despite the apparently adaptive and interactive nature of the problem, we prove optimal prophet inequalities for this problem under a variety of combinatorial constraints. That is, we give approximation algorithms that interact with the alternatives sequentially, where each must be fully explored and either selected or else discarded before the next arrives. In particular, we obtain a computationally efficient $\frac{1}{2}-ε$ prophet inequality for Combinatorial Markov Search subject to any matroid constraint. This result implies incentive-compatible mechanisms with constant Price of Anarchy for serving single-parameter agents when the agents strategically conduct independent, costly search processes to discover their values.
