The Data-Driven Censored Newsvendor Problem
Chamsi Hssaine, Sean R. Sinclair
TL;DR
This work introduces a data-driven censored newsvendor framework and uses distributionally robust optimization to quantify the impact of censored historical data on ordering decisions. A precise identifiability criterion, based on the observable boundary λ and the critical ratio ρ, yields a sharp dichotomy: if G^−(λ) ≥ ρ, vanishing minimax regret is achievable with q^Δ = q^*_G; otherwise, the problem is unidentifiable and the information loss Δ is strictly positive. The authors derive closed-form expressions for Δ and q^Δ, and propose Robust Censored Newsvendor (RCN), a two-stage algorithm that adapts to censoring level and achieves near-optimal regret with finite-sample guarantees across all regimes (Regret ≤ Δ + o(1/√N)) and provides matching lower bounds up to polylog factors. Extensive experiments on synthetic and real data confirm robust performance across censoring regimes and datasets. The framework offers practical guidance for inventory decisions under censored data and lays groundwork for extensions to contextual and multi-period settings.
Abstract
We study a censored variant of the data-driven newsvendor problem, where the decision-maker must select an ordering quantity that minimizes expected overage and underage costs based only on offline censored sales data, rather than historical demand realizations. Our goal is to understand how the degree of historical demand censoring affects the performance of any learning algorithm for this problem. To isolate this impact, we adopt a distributionally robust optimization framework, evaluating policies according to their worst-case regret over an ambiguity set of distributions. This set is defined by the largest historical order quantity (the observable boundary of the dataset), and contains all distributions matching the true demand distribution up to this boundary, while allowing them to be arbitrary afterwards. We demonstrate a spectrum of achievability under demand censoring by deriving a natural necessary and sufficient condition under which vanishing regret is an achievable goal. In regimes in which it is not, we exactly characterize the information loss due to censoring: an insurmountable lower bound on the performance of any policy, even when the decision-maker has access to infinitely many demand samples. We then leverage these sharp characterizations to propose a natural robust algorithm that adapts to the historical level of demand censoring. We derive finite-sample guarantees for this algorithm across all possible censoring regimes and show its near-optimality with matching lower bounds (up to polylogarithmic factors). We moreover demonstrate its robust performance via extensive numerical experiments on both synthetic and real-world datasets.
