Active Learning For Contextual Linear Optimization: A Margin-Based Approach

Mo Liu; Paul Grigas; Heyuan Liu; Zuo-Jun Max Shen

Active Learning For Contextual Linear Optimization: A Margin-Based Approach

Mo Liu, Paul Grigas, Heyuan Liu, Zuo-Jun Max Shen

TL;DR

This work introduces MBAL-SPO, the first active-learning framework tailored for contextual linear optimization where the objective coefficients are unknown and must be inferred from features. By leveraging the SPO loss and its SPO+ surrogate, the method uses a margin-based distance to degeneracy to decide when to acquire labels, achieving lower label complexity than fully supervised approaches. The authors provide non-asymptotic excess-risk bounds for both SPO and surrogate losses, along with detailed analyses for hard and soft rejection variants and separable SPO+ scenarios, under natural margin conditions. Empirical studies on shortest path and personalized pricing demonstrate that MBAL-SPO achieves substantially lower SPO risk with fewer labeled samples, validating its practical value in data-driven decision making. The work advances prescriptive analytics by integrating margin-based active learning with decision-focused learning, offering scalable guarantees and concrete guidance for label acquisition in cost-sensitive optimization tasks.

Abstract

We develop the first active learning method for contextual linear optimization. Specifically, we introduce a label acquisition algorithm that sequentially decides whether to request the ``labels'' of feature samples from an unlabeled data stream, where the labels correspond to the coefficients of the objective in the linear optimization. Our method is the first to be directly informed by the decision loss induced by the predicted coefficients, referred to as the Smart Predict-then-Optimize (SPO) loss. Motivated by the structure of the SPO loss, our algorithm adopts a margin-based criterion utilizing the concept of distance to degeneracy. In particular, we design an efficient active learning algorithm with theoretical excess risk (i.e., generalization) guarantees. We derive upper bounds on the label complexity, defined as the number of samples whose labels are acquired to achieve a desired small level of SPO risk. These bounds show that our algorithm has a much smaller label complexity than the naive supervised learning approach that labels all samples, particularly when the SPO loss is minimized directly on the collected data. To address the discontinuity and nonconvexity of the SPO loss, we derive label complexity bounds under tractable surrogate loss functions. Under natural margin conditions, these bounds also outperform naive supervised learning. Using the SPO+ loss, a specialized surrogate of the SPO loss, we establish even tighter bounds under separability conditions. Finally, we present numerical evidence showing the practical value of our algorithms in settings such as personalized pricing and the shortest path problem.

Active Learning For Contextual Linear Optimization: A Margin-Based Approach

TL;DR

Abstract

Paper Structure (44 sections, 17 theorems, 94 equations, 9 figures, 1 table, 2 algorithms)

This paper contains 44 sections, 17 theorems, 94 equations, 9 figures, 1 table, 2 algorithms.

Introduction
Motivating Example and Literature Review
Example Application: Personalized Pricing
Literature Review
Active learning.
Predict-then-optimize framework.
Organization
Preliminaries
Contextual Linear Optimization (CLO) and Active Learning
Surrogate Loss Functions and SPO+
Margin-Based Algorithm
Illustration and Motivation
MBAL-SPO Algorithm
Guarantees and Analysis for the Margin-Based Algorithm
MBAL-SPO under SPO Loss
...and 29 more sections

Key Result

Lemma 1

Given two cost vectors $c_1, c_2 \in \mathbb{R}^d$, if $\| c_1- c_2 \| < \max\{\nu_S(c_1), \nu_S(c_2)\}$, then it holds that $w^*(c_1) = w^*(c_2)$. In other words, the optimal decisions for $c_1$ and $c_2$ are the same.

Figures (9)

Figure 1: Illustration for how active learning reduces the label complexity, given the same prediction.
Figure 2: Illustration for how the SPO loss function reduces the label complexity, given the same size confidence region.
Figure 3: Risk on the test set during the training process in $3 \times 3$ grid, and $5 \times 5$ grid.
Figure 4: Excess test set risk during the training process in personalized pricing.
Figure 5: Performance under different initial quantile values in MBAL-SPO
...and 4 more figures

Theorems & Definitions (44)

Example 1: Personalized pricing via customer surveys
Definition 1
Lemma 1: Conditions for identical decisions
Definition 2: Near-degeneracy function
Lemma 2: Upper bound on the expected number of acquired labels
Theorem 1: SPO surrogate loss, hard rejection
Example 2: Example of distribution that satisfies Assumption \ref{['assumption:1']} for SPO loss
Definition 3: Sequential covering number
Proposition 1: Non i.i.d. generalization error bound
Proposition 2: Bound for the sequential covering number
...and 34 more

Active Learning For Contextual Linear Optimization: A Margin-Based Approach

TL;DR

Abstract

Active Learning For Contextual Linear Optimization: A Margin-Based Approach

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (44)