Table of Contents
Fetching ...

Exploring the Boundary of Diffusion-based Methods for Solving Constrained Optimization

Shutong Ding, Yimiao Zhou, Ke Hu, Xi Yao, Junchi Yan, Xiaoying Tang, Ye Shi

TL;DR

This work identifies a fundamental challenge in applying diffusion models to continuous constrained optimization: purely supervised diffusion tends to generate near-optimal but infeasible points in high dimensions. It introduces DiOpt, a two-phase diffusion framework that first uses a supervised warm-start and then engages a bootstrapped self-training regime guided by a feasibility-focused target distribution, along with a historical-solution replay mechanism to stabilize learning. The authors provide a theoretical justification for the infeasibility of purely supervised diffusion in high dimensions and show through extensive experiments (QP variants, ACOPF, and motion retargeting) that DiOpt achieves a favorable balance between feasibility and near-optimality, outperforming baselines. The approach has practical implications for real-world constrained optimization tasks in power systems, robotics, and related domains, where hard constraints and safety are critical.

Abstract

Diffusion models have achieved remarkable success in generative tasks such as image and video synthesis, and in control domains like robotics, owing to their strong generalization capabilities and proficiency in fitting complex multimodal distributions. However, their full potential in solving Continuous Constrained Optimization problems remains largely underexplored. Our work commences by investigating a two-dimensional constrained quadratic optimization problem as an illustrative example to explore the inherent challenges and issues when applying diffusion models to such optimization tasks and providing theoretical analyses for these observations. To address the identified gaps and harness diffusion models for Continuous Constrained Optimization, we build upon this analysis to propose a novel diffusion-based framework for optimization problems called DiOpt. This framework operates in two distinct phases: an initial warm-start phase, implemented via supervised learning, followed by a bootstrapping phase. This dual-phase architecture is designed to iteratively refine solutions, thereby improving the objective function while rigorously satisfying problem constraints. Finally, multiple candidate solutions are sampled, and the optimal one is selected through a screening process. We present extensive experiments detailing the training dynamics of DiOpt, its performance across a diverse set of Continuous Constrained Optimization problems, and an analysis of the impact of DiOpt's various hyperparameters.

Exploring the Boundary of Diffusion-based Methods for Solving Constrained Optimization

TL;DR

This work identifies a fundamental challenge in applying diffusion models to continuous constrained optimization: purely supervised diffusion tends to generate near-optimal but infeasible points in high dimensions. It introduces DiOpt, a two-phase diffusion framework that first uses a supervised warm-start and then engages a bootstrapped self-training regime guided by a feasibility-focused target distribution, along with a historical-solution replay mechanism to stabilize learning. The authors provide a theoretical justification for the infeasibility of purely supervised diffusion in high dimensions and show through extensive experiments (QP variants, ACOPF, and motion retargeting) that DiOpt achieves a favorable balance between feasibility and near-optimality, outperforming baselines. The approach has practical implications for real-world constrained optimization tasks in power systems, robotics, and related domains, where hard constraints and safety are critical.

Abstract

Diffusion models have achieved remarkable success in generative tasks such as image and video synthesis, and in control domains like robotics, owing to their strong generalization capabilities and proficiency in fitting complex multimodal distributions. However, their full potential in solving Continuous Constrained Optimization problems remains largely underexplored. Our work commences by investigating a two-dimensional constrained quadratic optimization problem as an illustrative example to explore the inherent challenges and issues when applying diffusion models to such optimization tasks and providing theoretical analyses for these observations. To address the identified gaps and harness diffusion models for Continuous Constrained Optimization, we build upon this analysis to propose a novel diffusion-based framework for optimization problems called DiOpt. This framework operates in two distinct phases: an initial warm-start phase, implemented via supervised learning, followed by a bootstrapping phase. This dual-phase architecture is designed to iteratively refine solutions, thereby improving the objective function while rigorously satisfying problem constraints. Finally, multiple candidate solutions are sampled, and the optimal one is selected through a screening process. We present extensive experiments detailing the training dynamics of DiOpt, its performance across a diverse set of Continuous Constrained Optimization problems, and an analysis of the impact of DiOpt's various hyperparameters.

Paper Structure

This paper contains 29 sections, 1 theorem, 34 equations, 34 figures, 3 tables, 3 algorithms.

Key Result

Theorem 1

(Feasibility in Linear Programming.) Given a d-dimensional linear programming problem of the form: where $a_i$ represents a unit normal vector, drawn independently and uniformly from the unit sphere $S^{d-1}$. Let $x^{\star}$ be the unique solution to this linear programming problem, and we define a neighborhood ball $B_{\epsilon}(x^\star)=\left\{x:\left\|x-x^\star\right\|\le \epsilon\right\}$. F

Figures (34)

  • Figure 1: A schematic geometric interpretation of feasibility challenges in learning-based optimization. The feasible region (blue) and neural network's output distribution (red), e.g., a Gaussian output by a diffusion model, fundamentally exhibit a small overlap, particularly under high dimensionality and multiple constraints. This distributional misalignment breaks constraint satisfaction.
  • Figure 2: Diffusion model trained in a supervised manner.
  • Figure 3: Diffusion solver trained by DiOpt.
  • Figure 5: Training and evaluating procedure of DiOpt.
  • Figure 6: Training progression across different optimization tasks, with each row representing a unique optimization problem (indicated in subtitles) and each column showing specific performance metrics. The horizontal axes track training epochs, while the vertical axes display corresponding metric values with detailed scales labeled for each plot. Notably, DiOpt achieves sudden gains in constraint satisfaction approximately at 2000 epochs when compared with Diffusion. The bootstrapping mechanism initially causes an upward perturbation in the objective function and optimality gap, which then progressively decays to near-optimal.
  • ...and 29 more figures

Theorems & Definitions (2)

  • Theorem 1
  • proof