Optimal single threshold stopping rules and sharp prophet inequalities
Alexander Goldenshluger, Yaakov Malinovsky, Assaf Zeevi
TL;DR
This paper addresses the finite-horizon iid optimal stopping problem of selecting the maximum value and benchmarks performance against a prophet with perfect foresight through sharp, non-asymptotic prophet inequalities. It introduces randomized single-threshold stopping rules and a unifying game-theoretic framework that recasts the problem as two-person zero-sum games on the unit square, yielding precise constants via saddlepoint solutions. A discretization-based computational paradigm using finite matrix games computes sharp ratio-type and difference-type constants, and the framework accommodates restricted distribution families (e.g., bounded variance and Pareto-like tails) with concrete numerical results. The work advances both theory and computation, providing explicit formulas, minimax representations, and practical algorithms to derive optimal simple stopping rules with finite-horizon guarantees and broad applicability in online decision-making. The results illuminate the structure of optimal stopping under iid assumptions and offer distribution-sensitive performance bounds with direct implications for mechanism design and sequential decision problems.
Abstract
This paper considers a finite horizon optimal stopping problem for a sequence of independent and identically distributed random variables. The objective is to design stopping rules that attempt to select the random variable with the highest value in the sequence. The performance of any stopping rule may be benchmarked relative to the selection of a "prophet" that has perfect foreknowledge of the largest value. Such comparisons are typically stated in the form of "prophet inequalities." In this paper we characterize sharp prophet inequalities for single threshold stopping rules as solutions to infinite two person zero sum games on the unit square with special payoff kernels. The proposed game theoretic characterization allows one to derive sharp non-asymptotic prophet inequalities for different classes of distributions. This, in turn, gives rise to a simple and computationally tractable algorithmic paradigm for deriving optimal single threshold stopping rules.
