Learning Decisions Offline from Censored Observations with ε-insensitive Operational Costs

Minxia Chen; Ke Fu; Teng Huang; Miao Bai

Learning Decisions Offline from Censored Observations with ε-insensitive Operational Costs

Minxia Chen, Ke Fu, Teng Huang, Miao Bai

TL;DR

The paper tackles offline, data-driven decision-making when historical observations are censored and censoring indicators are unavailable. It introduces $\varepsilon$-insensitive operational costs to guide one-step learning without distributional assumptions. It develops three algorithms—LR-$\varepsilon$NVC, LR-$\varepsilon$NVC-R, and NN-$\varepsilon$NVC—with corresponding stability and generalization guarantees, and demonstrates superior out-of-sample performance on a multi-feature newsvendor problem, achieving cost savings up to $14.40\%$ and $12.21\%$ compared to two strong baselines. The results highlight the approach's practical value for learning decisions from censored offline data in operations management.

Abstract

Many important managerial decisions are made based on censored observations. Making decisions without adequately handling the censoring leads to inferior outcomes. We investigate the data-driven decision-making problem with an offline dataset containing the feature data and the censored historical data of the variable of interest without the censoring indicators. Without assuming the underlying distribution, we design and leverage ε-insensitive operational costs to deal with the unobserved censoring in an offline data-driven fashion. We demonstrate the customization of the ε-insensitive operational costs for a newsvendor problem and use such costs to train two representative ML models, including linear regression (LR) models and neural networks (NNs). We derive tight generalization bounds for the custom LR model without regularization (LR-εNVC) and with regularization (LR-εNVC-R), and a high-probability generalization bound for the custom NN (NN-εNVC) trained by stochastic gradient descent. The theoretical results reveal the stability and learnability of LR-εNVC, LR-εNVC-R and NN-εNVC. We conduct extensive numerical experiments to compare LR-εNVC-R and NN-εNVC with two existing approaches, estimate-as-solution (EAS) and integrated estimation and optimization (IEO). The results show that LR-εNVC-R and NN-εNVC outperform both EAS and IEO, with maximum cost savings up to 14.40% and 12.21% compared to the lowest cost generated by the two existing approaches. In addition, LR-εNVC-R's and NN-εNVC's order quantities are statistically significantly closer to the optimal solutions should the underlying distribution be known.

Learning Decisions Offline from Censored Observations with ε-insensitive Operational Costs

TL;DR

The paper tackles offline, data-driven decision-making when historical observations are censored and censoring indicators are unavailable. It introduces

-insensitive operational costs to guide one-step learning without distributional assumptions. It develops three algorithms—LR-

NVC, LR-

NVC-R, and NN-

NVC—with corresponding stability and generalization guarantees, and demonstrates superior out-of-sample performance on a multi-feature newsvendor problem, achieving cost savings up to

and

compared to two strong baselines. The results highlight the approach's practical value for learning decisions from censored offline data in operations management.

Abstract

Paper Structure (34 sections, 10 theorems, 53 equations, 5 figures, 11 tables, 1 algorithm)

This paper contains 34 sections, 10 theorems, 53 equations, 5 figures, 11 tables, 1 algorithm.

Introduction
Decision-Making with Unobserved Censoring
Newsvendor with Sales Data.
Custom Pricing.
Preventive Replacement.
Model Overview
Summary of Results and Contributions
Literature Review
Data-Driven Decision-Making under Uncertainty
Censored Observations in Business Decision-Making
$\varepsilon$-insensitive Loss Functions
Offline Decision-Learning Algorithms with Censored Observations
Custom $\varepsilon$-insensitive Operational Costs
Newsvendor Ordering.
Custom Pricing.
...and 19 more sections

Key Result

Proposition 1

The gradient with respect to the parameters $\theta^j$ in the LR model for the loss function $\mathcal{L}^{\varepsilon NV}\xspace_i(\boldsymbol{\theta}|(\boldsymbol{x}_i,s_i))$ is

Figures (5)

Figure 1: A Simple Neural Network
Figure 2: Out-of-Sample Average Newsvendor Costs
Figure 3: Mean Percentage Cost Savings of LR-$\varepsilon$NVC-R (NN-$\varepsilon$NVC) over LR-NVC (NN-NVC)
Figure 4: Out-of-Sample $\text{RMSE}^{\text{Q}}$
Figure C.1: QQ Plot of Residual Error for Store 10 with Normal Distribution $\mathcal{N}(0,46.57^2)$

Theorems & Definitions (11)

Proposition 1: Gradients of $\mathcal{L}^{\varepsilon NV}\xspace$ in LR-$\varepsilon$NVC
Proposition 2: Gradients of $\mathcal{L}^{\varepsilon NV}\xspace$ in NN-$\varepsilon$NVC
Lemma 1: Tight uniform bound on $\mathcal{L}^{\varepsilon NV}\xspace$
Lemma 2
Proposition 3: Uniform stability of LR-$\varepsilon$NVC
Theorem 1: Generalization bound for LR-$\varepsilon$NVC
Proposition 4: Uniform stability of LR-$\varepsilon$NVC-R
Theorem 2: Generalization bound for LR-$\varepsilon$NVC-R
Proposition 5: Bound on UAS for NN-$\varepsilon$NVC (SGD)
Theorem 3: Generalization bound for NN-$\varepsilon$NVC (SGD)
...and 1 more

Learning Decisions Offline from Censored Observations with ε-insensitive Operational Costs

TL;DR

Abstract

Learning Decisions Offline from Censored Observations with ε-insensitive Operational Costs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (11)