DPAC: Distribution-Preserving Adversarial Control for Diffusion Sampling

Han-Jin Lee; Han-Ju Lee; Jin-Seong Kim; Seok-Hwan Choi

DPAC: Distribution-Preserving Adversarial Control for Diffusion Sampling

Han-Jin Lee, Han-Ju Lee, Jin-Seong Kim, Seok-Hwan Choi

TL;DR

This work tackles the instability of gradient-guided diffusion sampling by decomposing guidance into density-preserving (tangential) and density-changing (normal) components. By projecting adversarial gradients onto the tangent space defined by the score geometry, DPAC minimizes path-space KL and yields tighter bounds on $W_2$ and $FID$, while maintaining target attainment. The authors provide a rigorous theoretical framework (Girsanov, weighted Hodge decomposition, discrete bounds) and demonstrate practical gains on ImageNet-100, achieving lower FID and reduced energy consumption with a robust denoise-then-perturb injection. The approach is shown to be second-order robust to score/metric approximations and architecture-agnostic, offering a principled path to high-fidelity, adversarial diffusion guidance. Overall, DPAC unifies attack effectiveness with perceptual fidelity by an energy-minimization principle and practical tangential projection, delivering stable, efficient UAE generation.

Abstract

Adversarially guided diffusion sampling often achieves the target class, but sample quality degrades as deviations between the adversarially controlled and nominal trajectories accumulate. We formalize this degradation as a path-space Kullback-Leibler divergence(path-KL) between controlled and nominal (uncontrolled) diffusion processes, thereby showing via Girsanov's theorem that it exactly equals the control energy. Building on this stochastic optimal control (SOC) view, we theoretically establish that minimizing this path-KL simultaneously tightens upper bounds on both the 2-Wasserstein distance and Fréchet Inception Distance (FID), revealing a principled connection between adversarial control energy and perceptual fidelity. From a variational perspective, we derive a first-order optimality condition for the control: among all directions that yield the same classification gain, the component tangent to iso-(log-)density surfaces (i.e., orthogonal to the score) minimizes path-KL, whereas the normal component directly increases distributional drift. This leads to DPAC (Distribution-Preserving Adversarial Control), a diffusion guidance rule that projects adversarial gradients onto the tangent space defined by the generative score geometry. We further show that in discrete solvers, the tangent projection cancels the O(Δt) leading error term in the Wasserstein distance, achieving an O(Δt^2) quality gap; moreover, it remains second-order robust to score or metric approximation. Empirical studies on ImageNet-100 validate the theoretical predictions, confirming that DPAC achieves lower FID and estimated path-KL at matched attack success rates.

DPAC: Distribution-Preserving Adversarial Control for Diffusion Sampling

TL;DR

Abstract

DPAC: Distribution-Preserving Adversarial Control for Diffusion Sampling

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)