Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration

Yifeng Yu; Lu Yu

Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration

Yifeng Yu, Lu Yu

TL;DR

This work analyzes Wasserstein-2 convergence for score-based diffusion models under multiple discretization schemes, revealing how EM, EI, REM, and REI affect convergence and where discretization and score estimation errors enter. It then introduces a Hessian-based second-order acceleration via local linearization, achieving a near-optimal $\widetilde{O}(1/\varepsilon)$ convergence in $W_2$ and $\mathcal{O}(1/\varepsilon)$ iterations by leveraging Hessian information about the log-density. Theoretical results are complemented by numerical studies on penalized logistic regression posteriors, demonstrating that the Hessian-informed method consistently outperforms first-order schemes. The work broadens understanding of Wasserstein convergence in SGMs and provides practical guidance for choosing discretization strategies and leveraging second-order information to accelerate diffusion-based samplers. It also sets the stage for future work on relaxing strong log-concavity and extending to more general forward processes and deterministic ODE-based samplers.

Abstract

Score-based diffusion models have emerged as powerful tools in generative modeling, yet their theoretical foundations remain underexplored. In this work, we focus on the Wasserstein convergence analysis of score-based diffusion models. Specifically, we investigate the impact of various discretization schemes, including Euler discretization, exponential integrators, and midpoint randomization methods. Our analysis provides a quantitative comparison of these discrete approximations, emphasizing their influence on convergence behavior. Furthermore, we explore scenarios where Hessian information is available and propose an accelerated sampler based on the local linearization method. We demonstrate that this Hessian-based approach achieves faster convergence rates of order $\widetilde{\mathcal{O}}\left(\frac{1}{\varepsilon}\right)$ significantly improving upon the standard rate $\widetilde{\mathcal{O}}\left(\frac{1}{\varepsilon^2}\right)$ of vanilla diffusion models, where $\varepsilon$ denotes the target accuracy.

Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration

TL;DR

Abstract

Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (21)