Prophet Inequalities with Cancellation Costs

Farbod Ekbatani; Rad Niazadeh; Pranav Nuti; Jan Vondrak

Prophet Inequalities with Cancellation Costs

Farbod Ekbatani, Rad Niazadeh, Pranav Nuti, Jan Vondrak

TL;DR

This work extends the prophet inequality framework to online decisions with cancellations and linear buyback costs. The authors derive the optimal online performance for all buyback factors $f\ge 0$ by solving a nonlinear differential equation linked to a continuum flow representation, showing the optimal competitive ratio is $\alpha(f)=\frac{1}{2-y_f(1)}$. They also design a simple, polynomial-time, order-agnostic randomized policy that attains this optimum, using a scale-invariant Poisson point process to simulate quantile acceptance and carefully calibrated thresholds. The analysis hinges on a chain of reductions to monotone two-point distributions, factor-revealing LPs, a generalized flow with leakage, and a differential-equation embedding; worst-case instances are constructed from the same $y_f$ solution to demonstrate tightness. Practically, cancellations improve online revenue management across cloud markets, ads, and hotel bookings, and the order-agnostic policy provides a robust, implementable approach that leverages distributional knowledge without requiring arrival order.

Abstract

Most of the literature on online algorithms in revenue management focuses on settings with irrevocable decisions, where once a decision is made upon the arrival of a new input, it cannot be canceled later. Motivated by modern applications -- such as cloud spot markets, selling banner ads, or online hotel booking -- we introduce and study "prophet inequalities with cancellations" under linear cancellation costs (known as the buyback model). In the classic prophet inequality problem, a sequence of independent random variables $X_1, X_2, \ldots$ with known distributions is revealed one by one, and a decision maker must decide when to stop and accept the current variable in order to maximize the expected value of their choice. In our model, after accepting $X_j$, one may later discard $X_j$ and accept another $X_i$ at a cost of $f \times X_j$, where $f\geq 0$ is a parameter. The goal is to maximize the expected net reward: the value of the final accepted variable minus the total cancellation cost. We aim to design online policies that are competitive against the optimal offline benchmark. Our first main result is an optimal prophet inequality for all parameters $f \ge 0$. We fully characterize the worst-case competitive ratio of the optimal online policy against the optimal offline via the solution to a certain differential equation (for which we provide a constructive solution). Our second main result is to design and analyze a simple and polynomial-time randomized adaptive policy that achieves this optimal competitive ratio. Importantly, our policy is order-agnostic (à la [Samuel-Cahn, 1984]), as it only needs the set of distributions and not their arrival order. These results are obtained by novel techniques related to factor-revealing LPs and generalized flow, reductions to a differential equation, and embedding of problem instances into specific Poisson point processes.

Prophet Inequalities with Cancellation Costs

TL;DR

This work extends the prophet inequality framework to online decisions with cancellations and linear buyback costs. The authors derive the optimal online performance for all buyback factors

by solving a nonlinear differential equation linked to a continuum flow representation, showing the optimal competitive ratio is

. They also design a simple, polynomial-time, order-agnostic randomized policy that attains this optimum, using a scale-invariant Poisson point process to simulate quantile acceptance and carefully calibrated thresholds. The analysis hinges on a chain of reductions to monotone two-point distributions, factor-revealing LPs, a generalized flow with leakage, and a differential-equation embedding; worst-case instances are constructed from the same

solution to demonstrate tightness. Practically, cancellations improve online revenue management across cloud markets, ads, and hotel bookings, and the order-agnostic policy provides a robust, implementable approach that leverages distributional knowledge without requiring arrival order.

Abstract

with known distributions is revealed one by one, and a decision maker must decide when to stop and accept the current variable in order to maximize the expected value of their choice. In our model, after accepting

, one may later discard

and accept another

at a cost of

, where

is a parameter. The goal is to maximize the expected net reward: the value of the final accepted variable minus the total cancellation cost. We aim to design online policies that are competitive against the optimal offline benchmark. Our first main result is an optimal prophet inequality for all parameters

. We fully characterize the worst-case competitive ratio of the optimal online policy against the optimal offline via the solution to a certain differential equation (for which we provide a constructive solution). Our second main result is to design and analyze a simple and polynomial-time randomized adaptive policy that achieves this optimal competitive ratio. Importantly, our policy is order-agnostic (à la [Samuel-Cahn, 1984]), as it only needs the set of distributions and not their arrival order. These results are obtained by novel techniques related to factor-revealing LPs and generalized flow, reductions to a differential equation, and embedding of problem instances into specific Poisson point processes.

Paper Structure (61 sections, 35 theorems, 123 equations, 5 figures, 3 algorithms)

This paper contains 61 sections, 35 theorems, 123 equations, 5 figures, 3 algorithms.

Introduction
Characterizing the Optimal Competitive Ratio
Reduction to a monotonic sequence of two-point distributions
Factor-revealing LPs: finite and infinite-dimensional formulations
Reduction to the differential equation
Interpretation of \ref{['integrals']}
Greedy-type algorithms with a monotonicity property
Properties of the functions $\varphi$ and $y$
Identifying the differential equation
Formal analysis
Solving the differential equation: proof sketch
The case of $f \geq 1$:
The case of $\frac{1}{3} \leq f < 1$:
The general case.
Worst-Case Instances
...and 46 more sections

Key Result

Proposition 1

If a competitive ratio $\alpha(f)$ can be achieved for any instance with random variables $X_i = v_i \cdot \mathrm{Ber}(q_i)$, where $\mathrm{Ber}(q_i)$ is a $0/1$-Bernoulli random variable with expectation $q_i$ and $0 < v_1 \leq v_2\leq \ldots \leq v_n$, then the same competitive ratio $\alpha(f)$

Figures (5)

Figure 1: Comparison of different competitive ratio bounds. The blue curve is the optimal competitive ratio (this paper), the red curve is the classic 0.5 bound samuel1984comparison and the black curves are the optimal bounds in the adversarial setting for deterministic policies babaioff2008selling and for general randomized policies ashwinkumar2009randomized.
Figure 2: Interpreting \ref{['eq:discrete-LP']} as a generalized flow; the red arrow indicates "lost" flow or the leakage (due to the buyback).
Figure :
Figure EC.1: Different possible outcomes of the construction, for different values of $\theta$
Figure EC.2: The results of the numerical experiments for different values of $f\in\{0.1,0.2,0.3,0.5,1,2,3,5,7,10\}$. The orange lines specify the median while the green triangles identify the mean performance ratio of the policy across 100 sampled instances.

Theorems & Definitions (39)

Proposition 1
Lemma 1
Lemma 2
Lemma 3
Lemma 4
Lemma 5
Proposition 2
Definition 1: Y-Function Differential Equation
Theorem 1: Existence of Proper Y-Function
Theorem 2: Optimal Competitive Ratio -- Lower bound
...and 29 more

Prophet Inequalities with Cancellation Costs

TL;DR

Abstract

Prophet Inequalities with Cancellation Costs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (39)