Spectral Analysis of Diffusion Models with Application to Schedule Design
Roi Benita, Michael Elad, Joseph Keshet
TL;DR
This work provides a theoretical spectral lens on diffusion-model inference by assuming Gaussian target distributions, deriving a closed-form spectral transfer function that maps input noise to output signals. Using this framework, the authors formulate and solve a data-aware noise-schedule optimization under Wasserstein-2 or KL divergence, applicable to DDIM and DDPM under VP/VE. They demonstrate that optimal spectral schedules align with dataset spectral content and often resemble known heuristics, while offering improved performance with few diffusion steps on synthetic and real datasets (e.g., CIFAR-10, MUSIC, SC09). The approach clarifies the link between the data spectrum and diffusion dynamics, enabling principled schedule design and potential speedups in synthesis without retraining denoisers. Overall, the paper provides a principled, frequency-domain method to tailor diffusion schedules to data characteristics, with practical benefits for sample quality and efficiency.
Abstract
Diffusion models (DMs) have emerged as powerful tools for modeling complex data distributions and generating realistic new samples. Over the years, advanced architectures and sampling methods have been developed to make these models practically usable. However, certain synthesis process decisions still rely on heuristics without a solid theoretical foundation. In our work, we offer a novel analysis of the DM's inference process, introducing a comprehensive frequency response perspective. Specifically, by relying on Gaussianity assumption, we present the inference process as a closed-form spectral transfer function, capturing how the generated signal evolves in response to the initial noise. We demonstrate how the proposed analysis can be leveraged to design a noise schedule that aligns effectively with the characteristics of the data. The spectral perspective also provides insights into the underlying dynamics and sheds light on the relationship between spectral properties and noise schedule structure. Our results lead to scheduling curves that are dependent on the spectral content of the data, offering a theoretical justification for some of the heuristics taken by practitioners.
