Robust Losses for Decision-Focused Learning
Noah Schutte, Krzysztof Postek, Neil Yorke-Smith
TL;DR
This work analyzes decision-focused learning (DFL) for optimization under uncertainty and identifies critical weaknesses of using empirical regret as a surrogate, especially under epistemic and aleatoric uncertainty. It introduces three robust losses—Robust Optimization (RO) Loss, Top-k Loss, and k-Nearest Neighbour (k-NN) Loss—that aim to better approximate the expected regret by stabilizing the estimator of the conditional mean costs and/or considering near-optimal decisions. The authors characterize applicability, gradient computations, and computational implications, showing that these losses can be integrated with SPO+ and PFYL without increasing per-epoch cost and with modest precomputation overhead. Empirical evaluation on shortest path, traveling salesperson, and energy-cost aware scheduling demonstrates that robust losses reduce test-sample regret more reliably than empirical regret, with k-NN often providing the strongest gains, particularly in noisy or data-scarce settings, indicating practical benefits for real-world decision-making under uncertainty.
Abstract
Optimization models used to make discrete decisions often contain uncertain parameters that are context-dependent and estimated through prediction. To account for the quality of the decision made based on the prediction, decision-focused learning (end-to-end predict-then-optimize) aims at training the predictive model to minimize regret, i.e., the loss incurred by making a suboptimal decision. Despite the challenge of the gradient of this loss w.r.t. the predictive model parameters being zero almost everywhere for optimization problems with a linear objective, effective gradient-based learning approaches have been proposed to minimize the expected loss, using the empirical loss as a surrogate. However, empirical regret can be an ineffective surrogate because empirical optimal decisions can vary substantially from expected optimal decisions. To understand the impact of this deficiency, we evaluate the effect of aleatoric and epistemic uncertainty on the accuracy of empirical regret as a surrogate. Next, we propose three novel loss functions that approximate expected regret more robustly. Experimental results show that training two state-of-the-art decision-focused learning approaches using robust regret losses improves test-sample empirical regret in general while keeping computational time equivalent relative to the number of training epochs.
