Learning Solutions of Stochastic Optimization Problems with Bayesian Neural Networks
Alan A. Lahoud, Erik Schaffernicht, Johannes A. Stork
TL;DR
This work addresses decision-making under uncertainty in data-driven optimization by modeling unknown OP parameters with Bayesian Neural Networks and propagating the resulting predictive distribution through stochastic optimization. It introduces two learning paradigms: Decoupled learning, which fits the parameter distribution and then solves a stochastic OP via MC sampling, and Combined learning, which differentiates through the end-to-end solver to directly minimize the downstream cost. The framework is evaluated on Newsvendor, its quadratic variant, and a portfolio risk problem using synthetic and real data, demonstrating reduced decision regret $R$ and free-aleatoric regret $FR$ relative to strong baselines, with the Combined approach often delivering the best performance under limited data or sampling. The results provide guidance on when to apply each method and highlight the practical impact of sampling choices and data availability for robust data-driven optimization.
Abstract
Mathematical solvers use parametrized Optimization Problems (OPs) as inputs to yield optimal decisions. In many real-world settings, some of these parameters are unknown or uncertain. Recent research focuses on predicting the value of these unknown parameters using available contextual features, aiming to decrease decision regret by adopting end-to-end learning approaches. However, these approaches disregard prediction uncertainty and therefore make the mathematical solver susceptible to provide erroneous decisions in case of low-confidence predictions. We propose a novel framework that models prediction uncertainty with Bayesian Neural Networks (BNNs) and propagates this uncertainty into the mathematical solver with a Stochastic Programming technique. The differentiable nature of BNNs and differentiable mathematical solvers allow for two different learning approaches: In the Decoupled learning approach, we update the BNN weights to increase the quality of the predictions' distribution of the OP parameters, while in the Combined learning approach, we update the weights aiming to directly minimize the expected OP's cost function in a stochastic end-to-end fashion. We do an extensive evaluation using synthetic data with various noise properties and a real dataset, showing that decisions regret are generally lower (better) with both proposed methods.
