Generative Myopia: Why Diffusion Models Fail at Structure
Milad Siami
TL;DR
This work identifies Generative Myopia as the failure of likelihood-based graph diffusion to preserve rare but spectrally critical edges with high $R_eff$ under ELBO optimization. It proves a Failure Theorem showing gradient starvation prevents learning rare bridges and introduces Spectrally-Weighted Diffusion with a spectral prior that reweights the ELBO via a simple objective $L_RW$ to emphasize high-$R_eff$ edges, while keeping inference cost unchanged. The approach is validated through four controlled experiments (Barbell, Asymmetric Chain, Visible Bridge, and Optimization Dynamics) demonstrating that standard diffusion can miss critical connections, whereas the weighted method achieves near-perfect connectivity and matches a spectral oracle, including 100% connectivity on adversarial benchmarks. The results offer a principled path to fuse spectral graph theory with generative diffusion, enabling robust structure-aware graph synthesis in applications requiring global connectivity and resilience.
Abstract
Graph Diffusion Models (GDMs) optimize for statistical likelihood, implicitly acting as \textbf{frequency filters} that favor abundant substructures over spectrally critical ones. We term this phenomenon \textbf{Generative Myopia}. In combinatorial tasks like graph sparsification, this leads to the catastrophic removal of ``rare bridges,'' edges that are structurally mandatory ($R_{\text{eff}} \approx 1$) but statistically scarce. We prove theoretically and empirically that this failure is driven by \textbf{Gradient Starvation}: the optimization landscape itself suppresses rare structural signals, rendering them unlearnable regardless of model capacity. To resolve this, we introduce \textbf{Spectrally-Weighted Diffusion}, which re-aligns the variational objective using Effective Resistance. We demonstrate that spectral priors can be amortized into the training phase with zero inference overhead. Our method eliminates myopia, matching the performance of an optimal Spectral Oracle and achieving \textbf{100\% connectivity} on adversarial benchmarks where standard diffusion fails completely (0\%).
