Linear Convergence of Diffusion Models Under the Manifold Hypothesis
Peter Potaptchik, Iskander Azangulov, George Deligiannidis
TL;DR
This work analyzes score-based diffusion models under the manifold hypothesis, where data lie on a d-dimensional manifold within a high-dimensional space. By combining a martingale-based view of the backward process with a discretization tailored to the intrinsic dimension, the authors prove that the number of denoising steps needed to achieve KL convergence scales linearly with d up to logarithmic factors. They also establish the tightness of this linear dependence via a tensorization argument, showing that the bound cannot in general be improved with respect to d. The results help explain why diffusion models perform well in practice on data with low intrinsic dimensionality, despite large ambient dimensions.
Abstract
Score-matching generative models have proven successful at sampling from complex high-dimensional data distributions. In many applications, this distribution is believed to concentrate on a much lower $d$-dimensional manifold embedded into $D$-dimensional space; this is known as the manifold hypothesis. The current best-known convergence guarantees are either linear in $D$ or polynomial (superlinear) in $d$. The latter exploits a novel integration scheme for the backward SDE. We take the best of both worlds and show that the number of steps diffusion models require in order to converge in Kullback-Leibler~(KL) divergence is linear (up to logarithmic terms) in the intrinsic dimension $d$. Moreover, we show that this linear dependency is sharp.
