Learning What Matters: Steering Diffusion via Spectrally Anisotropic Forward Noise
Luca Scimeca, Thomas Jiralerspong, Berton Earnshaw, Jason Hartford, Yoshua Bengio
TL;DR
Diffusion probabilistic models rely on implicit inductive biases that shape learning. The authors introduce Spectrally Anisotropic Gaussian Diffusion (SAGD), which replaces the isotropic forward noise with a frequency-diagonal covariance in the Fourier domain, enabling targeted inductive biases via power-law or band-pass noise. They prove that, with full spectral support, the learned score converges to the true data score as $t\to0$, while anisotropy reshapes the learning trajectory. Empirically, SAGD improves image synthesis quality across diverse datasets, including large-scale ImageNet-1k in latent space, and enables selective omission of corrupted information by zeroing specific frequency bands. Overall, SAGD offers a simple, principled, drop-in mechanism to tailor inductive biases in diffusion-based generation.
Abstract
Diffusion Probabilistic Models (DPMs) have achieved strong generative performance, yet their inductive biases remain largely implicit. In this work, we aim to build inductive biases into the training and sampling of diffusion models to better accommodate the target distribution of the data to model. We introduce an anisotropic noise operator that shapes these biases by replacing the isotropic forward covariance with a structured, frequency-diagonal covariance. This operator unifies band-pass masks and power-law weightings, allowing us to emphasize or suppress designated frequency bands, while keeping the forward process Gaussian. We refer to this as Spectrally Anisotropic Gaussian Diffusion (SAGD). In this work, we derive the score relation for anisotropic forward covariances and show that, under full support, the learned score converges to the true data score as $t\!\to\!0$, while anisotropy reshapes the probability-flow path from noise to data. Empirically, we show the induced anisotropy outperforms standard diffusion across several vision datasets, and enables selective omission: learning while ignoring known corruptions confined to specific bands. Together, these results demonstrate that carefully designed anisotropic forward noise provides a simple, yet principled, handle to tailor inductive bias in DPMs.
