A Good Score Does not Lead to A Good Generative Model
Sixu Li, Shi Chen, Qin Li
TL;DR
This paper challenges the common belief that a well-learned score function guarantees genuinely generative samples in Score-based Generative Models (SGMs). By constructing a toy KDE-based argument and proving new non-asymptotic results, it shows that an empirical optimal score can render a DDPM that effectively operates as a Gaussian KDE, producing memorized replicas rather than novel samples. The authors establish explicit sample-complexity bounds for the empirical score and prove that, under those conditions, the resulting generator behaves like KDE, highlighting memorization as a fundamental limitation of current SGMs. Through numerical experiments on a 2D Gaussian and CIFAR-10, they illustrate the gap between distributional closeness and generation creativity, arguing for theoretical criteria that explicitly quantify generative novelty alongside imitation. The work underscores the need for new convergence notions that assess not only how close generated distributions are to the target but also how effectively SGMs can produce new, diverse samples.
Abstract
Score-based Generative Models (SGMs) is one leading method in generative modeling, renowned for their ability to generate high-quality samples from complex, high-dimensional data distributions. The method enjoys empirical success and is supported by rigorous theoretical convergence properties. In particular, it has been shown that SGMs can generate samples from a distribution that is close to the ground-truth if the underlying score function is learned well, suggesting the success of SGM as a generative model. We provide a counter-example in this paper. Through the sample complexity argument, we provide one specific setting where the score function is learned well. Yet, SGMs in this setting can only output samples that are Gaussian blurring of training data points, mimicking the effects of kernel density estimation. The finding resonates a series of recent finding that reveal that SGMs can demonstrate strong memorization effect and fail to generate.
