A Survey of Contextual Optimization Methods for Decision Making under Uncertainty
Utsav Sadana, Abhilash Chenreddy, Erick Delage, Alexandre Forel, Emma Frejinger, Thibaut Vidal
TL;DR
This survey consolidates the rapidly growing field of contextual optimization, where covariates are leveraged to make decisions under uncertainty. It codifies three learning paradigms—decision rule optimization, sequential learning and optimization, and integrated learning and optimization—and systematically reviews linear, kernel-based, non-linear, and distributionally robust policy classes, as well as methods for learning conditional distributions and end-to-end training via SPO and related surrogates. The authors synthesize training approaches (unrolling, implicit differentiation, surrogate losses, and differentiable optimizers) and map them to a wide range of applications, while outlining theoretical guarantees, toolboxes, and benchmarking needs. They also chart active research directions, from uncertainty in constraints and risk aversion to privacy, interpretability, and multi-agent settings, highlighting the practical impact and remaining theoretical gaps in contextual optimization. Overall, the paper provides a unifying framework, a comprehensive taxonomy, and a roadmap for advancing the integration of machine learning with stochastic programming in decision-making under uncertainty.
Abstract
Recently there has been a surge of interest in operations research (OR) and the machine learning (ML) community in combining prediction algorithms and optimization techniques to solve decision-making problems in the face of uncertainty. This gave rise to the field of contextual optimization, under which data-driven procedures are developed to prescribe actions to the decision-maker that make the best use of the most recently updated information. A large variety of models and methods have been presented in both OR and ML literature under a variety of names, including data-driven optimization, prescriptive optimization, predictive stochastic programming, policy optimization, (smart) predict/estimate-then-optimize, decision-focused learning, (task-based) end-to-end learning/forecasting/optimization, etc. Focusing on single and two-stage stochastic programming problems, this review article identifies three main frameworks for learning policies from data and discusses their strengths and limitations. We present the existing models and methods under a uniform notation and terminology and classify them according to the three main frameworks identified. Our objective with this survey is to both strengthen the general understanding of this active field of research and stimulate further theoretical and algorithmic advancements in integrating ML and stochastic programming.
