On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates

Stefano Bruno; Ying Zhang; Dong-Young Lim; Ömer Deniz Akyildiz; Sotirios Sabanis

On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates

Stefano Bruno, Ying Zhang, Dong-Young Lim, Ömer Deniz Akyildiz, Sotirios Sabanis

TL;DR

This paper provides non-asymptotic convergence guarantees for diffusion-based score-based generative models under strongly log-concave data. It introduces an auxiliary backward process that depends only on known information and analyzes score estimation via a Lipschitz, time-dependent approximator, linking optimization and sampling to yield explicit Wasserstein-2 bounds. In a motivating Gaussian-with-unknown-mean example, the authors obtain an optimal rate of order one with dimension dependence $\sqrt{d}$ and explicit constants, using SGLD for score optimization. For the general case, they derive bounds of the form $W_2\le C_1\sqrt{\varepsilon}+C_2 e^{-2\widehat{L}_{\text{MO}}(T-\varepsilon)-\varepsilon}+C_3(T,\varepsilon)\sqrt{\varepsilon_{\text{SN}}}+C_4(T,\varepsilon)\gamma^{\alpha}$, showing that appropriate choices of $(\varepsilon, T, \varepsilon_{\text{SN}}, \gamma)$ yield arbitrarily small error and highlighting improved dimension dependence under relaxed smoothness assumptions. The work thereby provides state-of-the-art, explicit convergence guarantees that quantify how diffusion sampling error scales with dimension, time discretization, and score-approximation quality, with practical implications for algorithm design and optimization in SGMs.

Abstract

We provide full theoretical guarantees for the convergence behaviour of diffusion-based generative models under the assumption of strongly log-concave data distributions while our approximating class of functions used for score estimation is made of Lipschitz continuous functions avoiding any Lipschitzness assumption on the score function. We demonstrate via a motivating example, sampling from a Gaussian distribution with unknown mean, the powerfulness of our approach. In this case, explicit estimates are provided for the associated optimization problem, i.e. score approximation, while these are combined with the corresponding sampling estimates. As a result, we obtain the best known upper bound estimates in terms of key quantities of interest, such as the dimension and rates of convergence, for the Wasserstein-2 distance between the data distribution (Gaussian with unknown mean) and our sampling algorithm. Beyond the motivating example and in order to allow for the use of a diverse range of stochastic optimizers, we present our results using an $L^2$-accurate score estimation assumption, which crucially is formed under an expectation with respect to the stochastic optimizer and our novel auxiliary process that uses only known information. This approach yields the best known convergence rate for our sampling algorithm.

On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates

TL;DR

and explicit constants, using SGLD for score optimization. For the general case, they derive bounds of the form

, showing that appropriate choices of

yield arbitrarily small error and highlighting improved dimension dependence under relaxed smoothness assumptions. The work thereby provides state-of-the-art, explicit convergence guarantees that quantify how diffusion sampling error scales with dimension, time discretization, and score-approximation quality, with practical implications for algorithm design and optimization in SGMs.

Abstract

-accurate score estimation assumption, which crucially is formed under an expectation with respect to the stochastic optimizer and our novel auxiliary process that uses only known information. This approach yields the best known convergence rate for our sampling algorithm.

Paper Structure (30 sections, 13 theorems, 149 equations, 1 figure, 4 tables)

This paper contains 30 sections, 13 theorems, 149 equations, 1 figure, 4 tables.

Introduction
Technical Background
Main Results
A Motivating Example: Full Estimates for Multivariate Gaussian Initial Data with Unknown Mean
General Case
Assumptions for the General Case
Full Estimates for the General Case
Related Work and Comparison
Score Approximation Assumptions
Assumptions on the Data Distribution
Objective Function via Denoising Score Matching
Additional Discussions about Assumption \ref{['assumption_equivalence_global_minimiser_epsilon']}
Proofs of the Results for the Multivariate Gaussian Initial Data with Unknown Mean
Preliminary Estimates
Proof of the Main Result in the Motivating Example
...and 15 more sections

Key Result

Theorem 1

Under the setting described in this section, then, for any $T>0$ and $\gamma \in (0,1/2]$, where $C_{\mathsf{SGLD},1}$ and $C_{\mathsf{SGLD},2}$ are given explicitly in Table tab:convconst. In addition, the result in statement_theorem_example_case_inequality implies that for any $\delta>0$, if we choose $T>T_{\delta}$, $\beta \geq \beta_{\delta}$, $0< \lambda\leq \lambda_{\delta}$, $n\ge where $

Figures (1)

Figure 1: The quality of generated samples with respect to (a) the error with Assumption \ref{['assumption_equivalence_global_minimiser_epsilon']} and (b) the error obtained through denoising score matching using $U(\theta)$ in \ref{['objective_with_denoising_score_matching_general']}.

Theorems & Definitions (35)

Theorem 1
Remark 2
Remark 3
Remark 4
Remark 5
Remark 6
Remark 7
Remark 8
Remark 9
Theorem 10
...and 25 more

On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates

TL;DR

Abstract

On diffusion-based generative models and their error bounds: The log-concave case with full convergence estimates

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (35)