Diffusion Models and the Manifold Hypothesis: Log-Domain Smoothing is Geometry Adaptive
Tyler Farghly, Peter Potaptchik, Samuel Howard, George Deligiannidis, Jakiw Pidstrigach
TL;DR
The paper investigates diffusion models through the lens of the manifold hypothesis and argues that smoothing the score in the log-density domain acts as geometry-adaptive regularisation. It establishes that log-domain smoothing aligns with manifold structure, proving exact equivalence to manifold-adapted smoothing in the linear (affine) setting and providing Rényi-divergence bounds for curved manifolds. The theory is complemented by high-dimensional experiments in latent and pixel spaces, showing that score-smoothed diffusion preserves or distributes mass along data manifolds and can interpolate along geometries consistent with the underlying structure. A key takeaway is that the smoothing kernel induces a geometric bias, enabling controlled generalisation along chosen manifolds, with practical implications for generation quality and diversity. The work also discusses limitations and future directions, such as relaxing assumptions about kernels and curvature and exploring architectural influences on smoothing behavior.
Abstract
Diffusion models have achieved state-of-the-art performance, demonstrating remarkable generalisation capabilities across diverse domains. However, the mechanisms underpinning these strong capabilities remain only partially understood. A leading conjecture, based on the manifold hypothesis, attributes this success to their ability to adapt to low-dimensional geometric structure within the data. This work provides evidence for this conjecture, focusing on how such phenomena could result from the formulation of the learning problem through score matching. We inspect the role of implicit regularisation by investigating the effect of smoothing minimisers of the empirical score matching objective. Our theoretical and empirical results confirm that smoothing the score function -- or equivalently, smoothing in the log-density domain -- produces smoothing tangential to the data manifold. In addition, we show that the manifold along which the diffusion model generalises can be controlled by choosing an appropriate smoothing.
