Table of Contents
Fetching ...

From Data to Uncertainty Sets: a Machine Learning Approach

Dimitris Bertsimas, Benjamin Boucher

TL;DR

This paper addresses the challenge of enforcing constraints when model-predicted parameters are uncertain. It introduces a loss-based uncertainty set that adapts to covariates via a trained predictor, and establishes probabilistic guarantees for constraint satisfaction. The approach generalizes ellipsoidal uncertainty sets, remains tractable under concave constraint functions, and offers stronger guarantees, including tailored bounds for mean-squared-error based regression with potential variance prediction. Empirical results on Newsvendor, Portfolio Optimization, and Shortest Path demonstrate substantially smaller uncertainty radii and improved objective values and regret relative to baselines. The method provides a practical, theoretically grounded way to incorporate ML uncertainty into prescriptive optimization with covariate information.

Abstract

Existing approaches of prescriptive analytics -- where inputs of an optimization model can be predicted by leveraging covariates in a machine learning model -- often attempt to optimize the mean value of an uncertain objective. However, when applied to uncertain constraints, these methods rarely work because satisfying a crucial constraint in expectation may result in a high probability of violation. To remedy this, we leverage robust optimization to protect a constraint against the uncertainty of a machine learning model's output. To do so, we design an uncertainty set based on the model's loss function. Intuitively, this approach attempts to minimize the uncertainty around a prediction. Extending guarantees from the robust optimization literature, we derive strong guarantees on the probability of violation. On synthetic computational experiments, our method requires uncertainty sets with radii up to one order of magnitude smaller than those of other approaches.

From Data to Uncertainty Sets: a Machine Learning Approach

TL;DR

This paper addresses the challenge of enforcing constraints when model-predicted parameters are uncertain. It introduces a loss-based uncertainty set that adapts to covariates via a trained predictor, and establishes probabilistic guarantees for constraint satisfaction. The approach generalizes ellipsoidal uncertainty sets, remains tractable under concave constraint functions, and offers stronger guarantees, including tailored bounds for mean-squared-error based regression with potential variance prediction. Empirical results on Newsvendor, Portfolio Optimization, and Shortest Path demonstrate substantially smaller uncertainty radii and improved objective values and regret relative to baselines. The method provides a practical, theoretically grounded way to incorporate ML uncertainty into prescriptive optimization with covariate information.

Abstract

Existing approaches of prescriptive analytics -- where inputs of an optimization model can be predicted by leveraging covariates in a machine learning model -- often attempt to optimize the mean value of an uncertain objective. However, when applied to uncertain constraints, these methods rarely work because satisfying a crucial constraint in expectation may result in a high probability of violation. To remedy this, we leverage robust optimization to protect a constraint against the uncertainty of a machine learning model's output. To do so, we design an uncertainty set based on the model's loss function. Intuitively, this approach attempts to minimize the uncertainty around a prediction. Extending guarantees from the robust optimization literature, we derive strong guarantees on the probability of violation. On synthetic computational experiments, our method requires uncertainty sets with radii up to one order of magnitude smaller than those of other approaches.

Paper Structure

This paper contains 16 sections, 5 theorems, 39 equations, 4 figures, 7 tables.

Key Result

Theorem 2.1

If $\forall j \in [m], \, \mathbb{E} \left[ \ell_{j} \left( \Tilde{y}^{(j)}, f_{j}^{*} \left( \Tilde{ \bm{X} } \right) \right) \right] > 0$, the minimizer of $\left( f, \bm{\sigma} \right) \mapsto \mathbb{E} \left[ \ell' \left( \Tilde{\bm{y}}, f \left( \Tilde{\bm{X}} \right), \bm{\sigma} \right) \ri

Figures (4)

  • Figure 1: Objective value for the portfolio optimization problem for various $N$ and $\Bar{\varepsilon}$.
  • Figure 2: Regret for the portfolio optimization problem for various $N$ and $\Bar{\varepsilon}$.
  • Figure 3: Objective value for the shortest path problem for various $\text{deg}$ and $\Bar{\varepsilon}$.
  • Figure 4: Regret for the shortest path problem for various $\text{deg}$ and $\Bar{\varepsilon}$.

Theorems & Definitions (10)

  • Theorem 2.1
  • proof
  • Proposition 1
  • proof
  • Proposition 2
  • proof
  • Theorem 3.1
  • proof
  • Theorem 3.2
  • proof