Scalable Decision Focused Learning via Online Trainable Surrogates
Gaetano Signorelli, Michele Lombardi
TL;DR
This work tackles the scalability bottlenecks of Decision Focused Learning by introducing a Gaussian Process-based surrogate that replaces costly regret evaluations during training. The surrogate is trained online with stochastic smoothing and an optional sample-sharing mechanism, and it can switch to a fall-back method (SFGE) when confidence is low, preserving asymptotically unbiased gradient information. Empirical results across knapsack, set cover, and toy benchmarks show substantial reductions in inner solver calls and runtime without sacrificing solution quality, outperforming several black-box surrogate methods. The approach enhances applicability of DFL to complex, real-world problems with recourse and nonlinear costs, while maintaining scalability and robustness in high dimensions.
Abstract
Decision support systems often rely on solving complex optimization problems that may require to estimate uncertain parameters beforehand. Recent studies have shown how using traditionally trained estimators for this task can lead to suboptimal solutions. Using the actual decision cost as a loss function (called Decision Focused Learning) can address this issue, but with a severe loss of scalability at training time. To address this issue, we propose an acceleration method based on replacing costly loss function evaluations with an efficient surrogate. Unlike previously defined surrogates, our approach relies on unbiased estimators reducing the risk of spurious local optima and can provide information on its local confidence allowing one to switch to a fallback method when needed. Furthermore, the surrogate is designed for a black-box setting, which enables compensating for simplifications in the optimization model and accounting for recourse actions during cost computation. In our results, the method reduces costly inner solver calls, with a solution quality comparable to other state-of-the-art techniques.
