Table of Contents
Fetching ...

From Sequential to Recursive: Enhancing Decision-Focused Learning with Bidirectional Feedback

Xinyu Wang, Jinxiao Du, Yiyang Peng, Wei Ma

TL;DR

This paper addresses the limitations of sequential decision-focused learning (S-DFL) by introducing Recursive DFL (R-DFL), which captures bidirectional feedback between prediction and optimization in closed-loop settings. It develops two gradient propagation approaches—explicit unrolling and implicit differentiation—and proves their gradient equivalence under suitable conditions, highlighting a trade-off between implementation simplicity and computational efficiency. Through extensive experiments on synthetic and real-world tasks (newsvendor and bipartite matching), R-DFL demonstrates substantial improvements in final decision quality over S-DFL and PTO, with implicit differentiation offering notably faster training. The work lays a foundation for truly unified prediction-prescription workflows in dynamic, interactive systems and suggests future extensions to stochastic and integer optimization scenarios.

Abstract

Decision-focused learning (DFL) has emerged as a powerful end-to-end alternative to conventional predict-then-optimize (PTO) pipelines by directly optimizing predictive models through downstream decision losses. Existing DFL frameworks are limited by their strictly sequential structure, referred to as sequential DFL (S-DFL). However, S-DFL fails to capture the bidirectional feedback between prediction and optimization in complex interaction scenarios. In view of this, we first time propose recursive decision-focused learning (R-DFL), a novel framework that introduces bidirectional feedback between downstream optimization and upstream prediction. We further extend two distinct differentiation methods: explicit unrolling via automatic differentiation and implicit differentiation based on fixed-point methods, to facilitate efficient gradient propagation in R-DFL. We rigorously prove that both methods achieve comparable gradient accuracy, with the implicit method offering superior computational efficiency. Extensive experiments on both synthetic and real-world datasets, including the newsvendor problem and the bipartite matching problem, demonstrate that R-DFL not only substantially enhances the final decision quality over sequential baselines but also exhibits robust adaptability across diverse scenarios in closed-loop decision-making problems.

From Sequential to Recursive: Enhancing Decision-Focused Learning with Bidirectional Feedback

TL;DR

This paper addresses the limitations of sequential decision-focused learning (S-DFL) by introducing Recursive DFL (R-DFL), which captures bidirectional feedback between prediction and optimization in closed-loop settings. It develops two gradient propagation approaches—explicit unrolling and implicit differentiation—and proves their gradient equivalence under suitable conditions, highlighting a trade-off between implementation simplicity and computational efficiency. Through extensive experiments on synthetic and real-world tasks (newsvendor and bipartite matching), R-DFL demonstrates substantial improvements in final decision quality over S-DFL and PTO, with implicit differentiation offering notably faster training. The work lays a foundation for truly unified prediction-prescription workflows in dynamic, interactive systems and suggests future extensions to stochastic and integer optimization scenarios.

Abstract

Decision-focused learning (DFL) has emerged as a powerful end-to-end alternative to conventional predict-then-optimize (PTO) pipelines by directly optimizing predictive models through downstream decision losses. Existing DFL frameworks are limited by their strictly sequential structure, referred to as sequential DFL (S-DFL). However, S-DFL fails to capture the bidirectional feedback between prediction and optimization in complex interaction scenarios. In view of this, we first time propose recursive decision-focused learning (R-DFL), a novel framework that introduces bidirectional feedback between downstream optimization and upstream prediction. We further extend two distinct differentiation methods: explicit unrolling via automatic differentiation and implicit differentiation based on fixed-point methods, to facilitate efficient gradient propagation in R-DFL. We rigorously prove that both methods achieve comparable gradient accuracy, with the implicit method offering superior computational efficiency. Extensive experiments on both synthetic and real-world datasets, including the newsvendor problem and the bipartite matching problem, demonstrate that R-DFL not only substantially enhances the final decision quality over sequential baselines but also exhibits robust adaptability across diverse scenarios in closed-loop decision-making problems.

Paper Structure

This paper contains 52 sections, 8 theorems, 74 equations, 5 figures, 3 tables, 2 algorithms.

Key Result

Theorem 1

Define the Jacobian of function $\Phi_\theta$ at $\boldsymbol{x}_{i}$ to be: The Jacobian $J_{\Phi_\theta}|_{\boldsymbol{x}_{i}}$ is computed as: Then the loss gradient $\mathcal{L}$ w.r.t. $\theta$ at the final $K^{th}$ layer is:

Figures (5)

  • Figure 1: Comparison of S-DFL and R-DFL. R-DFL extends S-DFL by capturing the bidirectional interaction between prediction and optimization, where predictions are informed by both input features and optimization feedback.
  • Figure 2: Illustration of the R-DFL framework with explicit unrolling and implicit differentiation methods.
  • Figure 3: Accuracy comparison between R-DFL-U and R-DFL-I on newsvendor dataset across three scales.
  • Figure 4: Sensitivity analysis of unrolling layers.
  • Figure 5: Training curves of R-DFL-U and R-DFL-I.

Theorems & Definitions (12)

  • Theorem 1: Gradient of Explicit Unrolling Methods
  • Theorem 2: Gradient of Implicit Differentiation Methods
  • Lemma 1: Convergence of Explicit Unrolling Methods
  • Theorem 3: Gradient Equivalence of Explicit Unrolling and Implicit Differentiation Methods
  • Theorem 1: Gradient of Explicit Unrolling Methods
  • proof
  • Theorem 2: Gradient of Implicit Differentiation Methods
  • proof
  • Lemma 2: Convergence of the Explicit Unrolling Methods
  • proof
  • ...and 2 more