Contractive Diffusion Probabilistic Models

Wenpin Tang; Hanyang Zhao

Contractive Diffusion Probabilistic Models

Wenpin Tang, Hanyang Zhao

TL;DR

This work introduces Contractive Diffusion Probabilistic Models (CDPMs), a framework that enforces contractive backward sampling to curb the propagation of score-matching and discretization errors in diffusion models. It develops theoretical Wasserstein-2 bounds, shows how contraction can be achieved via design choices (contractive OU, VP, and subVP SDEs), and connects CDPMs to VE through a transformation that leverages pretrained scores without retraining. Empirically, CDPMs demonstrate robustness and improved performance across 1D, Swiss Roll, MNIST, CIFAR-10, and AFHQ data, notably achieving competitive CIFAR-10 results while requiring no retraining. The results advocate for incorporating contraction into DPM design as a principled path to more reliable and efficient generative modeling.

Abstract

Diffusion probabilistic models (DPMs) have emerged as a promising technique in generative modeling. The success of DPMs relies on two ingredients: time reversal of diffusion processes and score matching. In view of possibly unguaranteed score matching, we propose a new criterion -- the contraction property of backward sampling in the design of DPMs, leading to a novel class of contractive DPMs (CDPMs). Our key insight is that, the contraction property can provably narrow score matching errors and discretization errors, thus our proposed CDPMs are robust to both sources of error. For practical use, we show that CDPM can leverage weights of pretrained DPMs by a simple transformation, and does not need retraining. We corroborated our approach by experiments on synthetic 1-dim examples, Swiss Roll, MNIST, CIFAR-10 32$\times$32 and AFHQ 64$\times$64 dataset. Notably, CDPM steadily improves the performance of baseline score-based diffusion models.

Contractive Diffusion Probabilistic Models

TL;DR

Abstract

32 and AFHQ 64

64 dataset. Notably, CDPM steadily improves the performance of baseline score-based diffusion models.

Paper Structure (16 sections, 8 theorems, 81 equations, 5 figures, 8 tables)

This paper contains 16 sections, 8 theorems, 81 equations, 5 figures, 8 tables.

Introduction
Background on Score-based Diffusion Models
Forward and Backward SDEs.
Score Matching
Designs of $b$ and $\sigma$.
Theory for contractive DPMs
Sampling error in continuous time
Discretization error
Connections between CDPM and VE
VE is contractive at earlier denoising steps
CDPM can be transformed from VE
Experiments
CDPM shows better performance with the same scoring matching error
Swiss Roll and MNIST datasets
CIFAR-10 dataset
...and 1 more sections

Key Result

Theorem 2

Let Assumption assump:sensitivity hold, and any $h > 0$. Define $\eta:=W_2(p(T, \cdot), p_{\hbox{noise}}(\cdot))$, and Then we have

Figures (5)

Figure 1: Contraction of VE
Figure 2: Perturbation kernels
Figure 3: CIFAR-10 Synthesis (CsubVP).
Figure 4: (AFHQv2 sample) LEFT: EDM, RIGHT: EDM with contraction.
Figure 5: CsubVP CIFAR10 samples

Theorems & Definitions (13)

Theorem 2
Theorem 3
Theorem 6
Theorem 7
Theorem 8
proof
Theorem 9
proof
Lemma 10
proof
...and 3 more

Contractive Diffusion Probabilistic Models

TL;DR

Abstract

Contractive Diffusion Probabilistic Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (13)