Dynamic Pricing with Adversarially-Censored Demands

Jianyu Xu; Yining Wang; Xi Chen; Yu-Xiang Wang

Dynamic Pricing with Adversarially-Censored Demands

Jianyu Xu, Yining Wang, Xi Chen, Yu-Xiang Wang

TL;DR

This work tackles online dynamic pricing with adversarially censored demand due to perishable inventories. It introduces the C20CB algorithm, which first performs pure exploration to estimate the linear demand parameters $a$ and $b$ from biased censored observations, then uses an optimistic, derivative-based pricing rule to learn the noise distribution and approach the time-varying optimal price. The authors prove a high-probability regret bound of $ ilde{O}( ext{sqrt}(T))$, matching information-theoretic lower bounds for related settings, and demonstrate the method’s robustness to adversarial inventory sequences. The proposed framework advances online decision-making under censored feedback and has potential extensions to contextual pricing, unbounded-noise scenarios, and non-linear demand, with significant implications for pricing strategies on perishable goods.

Abstract

We study an online dynamic pricing problem where the potential demand at each time period $t=1,2,\ldots, T$ is stochastic and dependent on the price. However, a perishable inventory is imposed at the beginning of each time $t$, censoring the potential demand if it exceeds the inventory level. To address this problem, we introduce a pricing algorithm based on the optimistic estimates of derivatives. We show that our algorithm achieves $\tilde{O}(\sqrt{T})$ optimal regret even with adversarial inventory series. Our findings advance the state-of-the-art in online decision-making problems with censored feedback, offering a theoretically optimal solution against adversarial observations.

Dynamic Pricing with Adversarially-Censored Demands

TL;DR

and

from biased censored observations, then uses an optimistic, derivative-based pricing rule to learn the noise distribution and approach the time-varying optimal price. The authors prove a high-probability regret bound of

, matching information-theoretic lower bounds for related settings, and demonstrate the method’s robustness to adversarial inventory sequences. The proposed framework advances online decision-making under censored feedback and has potential extensions to contextual pricing, unbounded-noise scenarios, and non-linear demand, with significant implications for pricing strategies on perishable goods.

Abstract

We study an online dynamic pricing problem where the potential demand at each time period

is stochastic and dependent on the price. However, a perishable inventory is imposed at the beginning of each time

, censoring the potential demand if it exceeds the inventory level. To address this problem, we introduce a pricing algorithm based on the optimistic estimates of derivatives. We show that our algorithm achieves

optimal regret even with adversarial inventory series. Our findings advance the state-of-the-art in online decision-making problems with censored feedback, offering a theoretically optimal solution against adversarial observations.

Dynamic Pricing with Adversarially-Censored Demands

TL;DR

Abstract

Dynamic Pricing with Adversarially-Censored Demands

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (18)