The Nonstationary Newsvendor with (and without) Predictions

Lin An; Andrew A. Li; Benjamin Moseley; R. Ravi

The Nonstationary Newsvendor with (and without) Predictions

Lin An, Andrew A. Li, Benjamin Moseley, R. Ravi

TL;DR

This work tackles sequential inventory decisions under unknown, nonstationary demand through a formal Nonstationary Newsvendor model with a variation parameter $v\in[0,1]$ and optional generic predictions with accuracy exponent $a$. It delivers a complete theoretical treatment for the no-prediction setting, proving a minimax lower bound $\Omega(T^{(3+v)/4})$ and an algorithm with matching upper bound $\tilde{O}(T^{(3+v)/4})$, even when $v$ is unknown, by using fixed-time or shrinking windows. It then extends to a prediction-augmented setting, introducing the Prediction-Error-Robust Policy (PERP) that robustly leverages predictions without knowing their accuracy and achieves $\tilde{O}(T^{\min\{(3+v)/4, a\}})$ regret when $v$ is known, with lower bounds showing limits when both $v$ and $a$ are unknown. Empirical results on three real datasets (Wikipedia, Rossmann, and a Japanese restaurant) demonstrate that PERP closes a substantial portion of the gap between prediction-free and prediction-based baselines, highlighting practical value in dynamic, uncertain environments. Overall, the paper advances online inventory optimization under nonstationarity by jointly optimizing for unknown demand evolution and potentially noisy predictions, yielding provable regret guarantees and robust real-world performance.

Abstract

The classic newsvendor model yields an optimal decision for a ``newsvendor'' selecting a quantity of inventory, under the assumption that the demand is drawn from a known distribution. Motivated by applications such as cloud provisioning and staffing, we consider a setting in which newsvendor-type decisions must be made sequentially, in the face of demand drawn from a stochastic process that is both unknown and nonstationary. All prior work on this problem either (a) assumes that the level of nonstationarity is known, or (b) imposes additional statistical assumptions that enable accurate predictions of the unknown demand. Our research tackles the Nonstationary Newsvendor without these assumptions, both with and without predictions. We first, in the setting without predictions, design a policy which we prove achieves order-optimal regret -- ours is the first policy to accomplish this without being given the level of nonstationarity of the underlying demand. We then, for the first time, introduce a model for generic (i.e. with no statistical assumptions) predictions with arbitrary accuracy, and propose a policy that incorporates these predictions without being given their accuracy. We upper bound the regret of this policy, and show that it matches the best achievable regret had the accuracy of the predictions been known. Our findings provide valuable insights on inventory management. Managers can make more informed and effective decisions in dynamic environments, reducing costs and enhancing service levels despite uncertain demand patterns. We empirically validate our new policy with experiments based on three real-world datasets containing thousands of time-series, showing that it succeeds in closing approximately 74% of the gap between the best approaches based on nonstationarity and predictions alone.

The Nonstationary Newsvendor with (and without) Predictions

TL;DR

This work tackles sequential inventory decisions under unknown, nonstationary demand through a formal Nonstationary Newsvendor model with a variation parameter

and optional generic predictions with accuracy exponent

. It delivers a complete theoretical treatment for the no-prediction setting, proving a minimax lower bound

and an algorithm with matching upper bound

, even when

is unknown, by using fixed-time or shrinking windows. It then extends to a prediction-augmented setting, introducing the Prediction-Error-Robust Policy (PERP) that robustly leverages predictions without knowing their accuracy and achieves

regret when

is known, with lower bounds showing limits when both

and

are unknown. Empirical results on three real datasets (Wikipedia, Rossmann, and a Japanese restaurant) demonstrate that PERP closes a substantial portion of the gap between prediction-free and prediction-based baselines, highlighting practical value in dynamic, uncertain environments. Overall, the paper advances online inventory optimization under nonstationarity by jointly optimizing for unknown demand evolution and potentially noisy predictions, yielding provable regret guarantees and robust real-world performance.

Abstract

Paper Structure (38 sections, 21 theorems, 88 equations, 6 figures, 4 tables, 6 algorithms)

This paper contains 38 sections, 21 theorems, 88 equations, 6 figures, 4 tables, 6 algorithms.

Introduction
The Nonstationary Newsvendor, with and without Predictions
Our Contributions
1. Nonstationary Newsvendor (without predictions):
2. Nonstationary Newsvendor with Predictions:
3. Empirical Results:
Literature Review
Model: The Nonstationary Newsvendor (without Predictions)
Demand Distributions
Aside: Comparison to keskin2021nonstationary:
Demand Variation
Aside: Time-Series Modeling:
Performance Metric: Regret
Solution to the Nonstationary Newsvendor (without Predictions)
Upper Bound with Known Variation Parameter $v$
...and 23 more sections

Key Result

Theorem 1

There exists a policy which achieves $\tilde{O}(T^{(3+v)/4})$ regretThe $\tilde{O}(\cdot)$ notation hides logarithmic factors. without knowing $v$.

Figures (6)

Figure 1: Daily number of customers (in blue), from September 2014 to January 2015, at two different stores in the Rossmann drug store chain. Predictions (in red), starting November 2014, are generated using Exponential Smoothing with the same fitting process. The store in the upper sub-figure has substantially more accurate predictions ($R^2=0.88$) than that of the lower sub-figure ($R^2=0.11$).
Figure 2: The costs of NO-PRED, PURE-PRED, PERP, and OPT when (a) the variation parameter $v$ is fixed, and (b) the accuracy parameter $a$ is fixed. Each dot represents the cost of the corresponding policy on a given instance.
Figure 3: An example of a single time series from each dataset. In (c), the red dashed line represents an additional fetaure: daily online reservations.
Figure 4: Histograms of GAPs across (a) 1,000 randomly-sampled instances on the Rossmann dataset, (b) 2,880 instances on the Wikipedia dataset, and (c) 740 instances on the Restaurant dataset.
Figure 5: Histograms of the GAPs for (a) 173 Rossmann instances (left), 522 Wikipedia instances (middle), 476 Restaurant instances (right) for which PURE-PRED has lower cost, and (b) 827 Rossmann instances (left), 2358 Wikipedia instances (middle), 264 Restaurant instances (right) for which NO-PRED has lower cost.
...and 1 more figures

Theorems & Definitions (39)

Theorem 1: Informal
Proposition 1: Informal
Proposition 2: Informal
Theorem 2: Informal
Proposition 3: Informal
Lemma 1
Proposition 1: Lower Bound: Nonstationary Newsvendor
Lemma 2: Upper Bound: Nonstationary Newsvendor with Known $v$
Theorem 1: Upper Bound: Nonstationary Newsvendor with Unknown $v$
proof : Proof Sketch of Theorem \ref{['upper bound on regret: past-demand-only with unknown variation']}
...and 29 more

The Nonstationary Newsvendor with (and without) Predictions

TL;DR

Abstract

The Nonstationary Newsvendor with (and without) Predictions

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (39)