Optimal Guarantees for Online Selection Over Time

Sebastian Perez-Salazar; Victor Verdugo

Optimal Guarantees for Online Selection Over Time

Sebastian Perez-Salazar, Victor Verdugo

TL;DR

The paper advances the theory of prophet inequalities over time (POT) in the IID setting by deriving best-possible worst-case guarantees for both a single-threshold policy and the optimal dynamic programming policy. It develops a density-based analysis and a convex-optimization framework to obtain tight bounds for small numbers of thresholds (1, 2, and 3) and then characterizes the optimal policy for any finite horizon $n$ via a convex program, with a limit analysis yielding an asymptotic ratio near $0.618$. It also extends the discussion to adversarial and random-order models, proving a constant-factor guarantee of about $0.162$ in the random-order POT, and a lower-bound hardness under adversarial ordering. The methods connect threshold-based policies to infinite- and finite-dimensional convex programs, enabling exact computation of worst-case ratios and shedding light on the gap between simple, implementable strategies and the optimal offline benchmark. Overall, the work sharpens the understanding of the trade-off between commitment duration and opportunity capture, with implications for online selection and pricing mechanisms under time-constrained decisions.

Abstract

Prophet inequalities are a cornerstone in optimal stopping and online decision-making. Traditionally, they involve the sequential observation of $n$ non-negative independent random variables and face irrevocable accept-or-reject choices. The goal is to provide policies that provide a good approximation ratio against the optimal offline solution that can access all the values upfront -- the so-called prophet value. In the prophet inequality over time problem (POT), the decision-maker can commit to an accepted value for $τ$ units of time, during which no new values can be accepted. This creates a trade-off between the duration of commitment and the opportunity to capture potentially higher future values. In this work, we provide best possible worst-case approximation ratios in the IID setting of POT for single-threshold algorithms and the optimal dynamic programming policy. We show a single-threshold algorithm that achieves an approximation ratio of $(1+e^{-2})/2\approx 0.567$, and we prove that no single-threshold algorithm can surpass this guarantee. With our techniques, we can analyze simple algorithms using $k$ thresholds and show that with $k=3$ it is possible to get an approximation ratio larger than $\approx 0.602$. Then, for each $n$, we prove it is possible to compute the tight worst-case approximation ratio of the optimal dynamic programming policy for instances with $n$ values by solving a convex optimization program. A limit analysis of the first-order optimality conditions yields a nonlinear differential equation showing that the optimal dynamic programming policy's asymptotic worst-case approximation ratio is $\approx 0.618$. Finally, we extend the discussion to adversarial settings and show an optimal worst-case approximation ratio of $\approx 0.162$ when the values are streamed in random order.

Optimal Guarantees for Online Selection Over Time

TL;DR

via a convex program, with a limit analysis yielding an asymptotic ratio near

. It also extends the discussion to adversarial and random-order models, proving a constant-factor guarantee of about

in the random-order POT, and a lower-bound hardness under adversarial ordering. The methods connect threshold-based policies to infinite- and finite-dimensional convex programs, enabling exact computation of worst-case ratios and shedding light on the gap between simple, implementable strategies and the optimal offline benchmark. Overall, the work sharpens the understanding of the trade-off between commitment duration and opportunity capture, with implications for online selection and pricing mechanisms under time-constrained decisions.

Abstract

Prophet inequalities are a cornerstone in optimal stopping and online decision-making. Traditionally, they involve the sequential observation of

non-negative independent random variables and face irrevocable accept-or-reject choices. The goal is to provide policies that provide a good approximation ratio against the optimal offline solution that can access all the values upfront -- the so-called prophet value. In the prophet inequality over time problem (POT), the decision-maker can commit to an accepted value for

units of time, during which no new values can be accepted. This creates a trade-off between the duration of commitment and the opportunity to capture potentially higher future values. In this work, we provide best possible worst-case approximation ratios in the IID setting of POT for single-threshold algorithms and the optimal dynamic programming policy. We show a single-threshold algorithm that achieves an approximation ratio of

, and we prove that no single-threshold algorithm can surpass this guarantee. With our techniques, we can analyze simple algorithms using

thresholds and show that with

it is possible to get an approximation ratio larger than

. Then, for each

, we prove it is possible to compute the tight worst-case approximation ratio of the optimal dynamic programming policy for instances with

values by solving a convex optimization program. A limit analysis of the first-order optimality conditions yields a nonlinear differential equation showing that the optimal dynamic programming policy's asymptotic worst-case approximation ratio is

. Finally, we extend the discussion to adversarial settings and show an optimal worst-case approximation ratio of

when the values are streamed in random order.

Paper Structure (19 sections, 21 theorems, 141 equations, 1 figure, 1 table, 1 algorithm)

This paper contains 19 sections, 21 theorems, 141 equations, 1 figure, 1 table, 1 algorithm.

Introduction
Our Contribution and Results
Related Work
Improved Guarantees for Small Number of Thresholds
General Multiple Threshold Analysis
Optimal Analysis for Single-Threshold Algorithms
Analysis and Guarantees for Multiple Thresholds
Analysis for $k=2$ Thresholds.
Thresholds with Equidistant Intervals.
Tightness via Convex Optimization
Proof of Theorem \ref{['thm:apx-characterization']}
Step 1: An Infinite-Dimensional Optimization Problem.
Step 2: The Convex Optimization Problem.
Random Order Model
Proofs Deferred from Section \ref{['sec:small_thresholds']}
...and 4 more sections

Key Result

Proposition 1

Let $\pi$ be a policy that guarantees an approximation ratio of $\beta>0$ in the POT problem for all probability distributions $F$ that are strictly increasing and infinitely differentiable. Then, $\gamma \geq \beta$.

Figures (1)

Figure 1: Plot of $\bar{d}$ in the range $[0,2.2]$. We note that $\bar{d}$ is $0$ at $\lambda=0$ and $\lambda=2$.

Theorems & Definitions (57)

Proposition 1
Lemma 1: Key Lower Bound
proof
Proposition 2
Proposition 3
Lemma 2
proof : Proof of Lemma \ref{['lem:LB_for_1_threshold']}
Proposition 4
Proposition 5
Lemma 3
...and 47 more

Optimal Guarantees for Online Selection Over Time

TL;DR

Abstract

Optimal Guarantees for Online Selection Over Time

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (57)