Dynamic Pricing with Adversarially-Censored Demands
Jianyu Xu, Yining Wang, Xi Chen, Yu-Xiang Wang
TL;DR
This work tackles online dynamic pricing with adversarially censored demand due to perishable inventories. It introduces the C20CB algorithm, which first performs pure exploration to estimate the linear demand parameters $a$ and $b$ from biased censored observations, then uses an optimistic, derivative-based pricing rule to learn the noise distribution and approach the time-varying optimal price. The authors prove a high-probability regret bound of $ ilde{O}( ext{sqrt}(T))$, matching information-theoretic lower bounds for related settings, and demonstrate the method’s robustness to adversarial inventory sequences. The proposed framework advances online decision-making under censored feedback and has potential extensions to contextual pricing, unbounded-noise scenarios, and non-linear demand, with significant implications for pricing strategies on perishable goods.
Abstract
We study an online dynamic pricing problem where the potential demand at each time period $t=1,2,\ldots, T$ is stochastic and dependent on the price. However, a perishable inventory is imposed at the beginning of each time $t$, censoring the potential demand if it exceeds the inventory level. To address this problem, we introduce a pricing algorithm based on the optimistic estimates of derivatives. We show that our algorithm achieves $\tilde{O}(\sqrt{T})$ optimal regret even with adversarial inventory series. Our findings advance the state-of-the-art in online decision-making problems with censored feedback, offering a theoretically optimal solution against adversarial observations.
