Diffusion-DFL: Decision-focused Diffusion Models for Stochastic Optimization
Zihao Zhao, Christopher Yeh, Lingkai Kong, Kai Wang
TL;DR
This work addresses decision-focused learning under parameter uncertainty by introducing diffusion-based predictors to model complex, multi-modal distributions of uncertain parameters. It develops two end-to-end training approaches: a memory-intensive reparameterization method and a lightweight score-function estimator with a weighted ELBO gradient and variance-reduction, enabling scalable diffusion DFL. Across synthetic, power-scheduling, and portfolio tasks, diffusion DFL consistently outperforms two-stage and deterministic baselines, with the score-function variant achieving comparable decision quality while dramatically reducing memory usage (from around 60.75 GB to 0.13 GB). The proposed framework advances practical decision-making under uncertainty and provides open-source code to facilitate reproducibility and further research.
Abstract
Decision-focused learning (DFL) integrates predictive modeling and optimization by training predictors to optimize the downstream decision target rather than merely minimizing prediction error. To date, existing DFL methods typically rely on deterministic point predictions, which are often insufficient to capture the intrinsic stochasticity of real-world environments. To address this challenge, we propose the first diffusion-based DFL approach, which trains a diffusion model to represent the distribution of uncertain parameters and optimizes the decision by solving a stochastic optimization with samples drawn from the diffusion model. Our contributions are twofold. First, we formulate diffusion DFL using the reparameterization trick, enabling end-to-end training through diffusion. While effective, it is memory and compute-intensive due to the need to differentiate through the diffusion sampling process. Second, we propose a lightweight score function estimator that uses only several forward diffusion passes and avoids backpropagation through the sampling. This follows from our results that backpropagating through stochastic optimization can be approximated by a weighted score function formulation. We empirically show that our diffusion DFL approach consistently outperforms strong baselines in decision quality. The source code for all experiments is available at the project repository: https://github.com/GT-KOALA/Diffusion_DFL.
