Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?
Zebin You, Xinyu Zhang, Hanzhong Guo, Jingdong Wang, Chongxuan Li
TL;DR
This work introduces distribution classification as a diagnostic tool to quantify how far diffusion-generated images are from real data, revealing that neural-network classifiers can reliably distinguish real from generated distributions even when FID and human judgments suggest high realism. It shows contradictions between classifier-based distribution distance, FID, and human perception, and demonstrates practical implications including when to augment versus replace real data and how classifier guidance can improve generated image realism. The study provides granular insights into spatial and frequency features learned by diffusion models and highlights MAD (Model Autophagy Disorder) as a consequence of distribution drift in continual synthetic-data loops. Overall, distribution classification offers a complementary perspective to traditional metrics, enabling deeper understanding and practical improvements in generative modeling and data usage.
Abstract
The ultimate goal of generative models is to perfectly capture the data distribution. For image generation, common metrics of visual quality (e.g., FID) and the perceived truthfulness of generated images seem to suggest that we are nearing this goal. However, through distribution classification tasks, we reveal that, from the perspective of neural network-based classifiers, even advanced diffusion models are still far from this goal. Specifically, classifiers are able to consistently and effortlessly distinguish real images from generated ones across various settings. Moreover, we uncover an intriguing discrepancy: classifiers can easily differentiate between diffusion models with comparable performance (e.g., U-ViT-H vs. DiT-XL), but struggle to distinguish between models within the same family but of different scales (e.g., EDM2-XS vs. EDM2-XXL). Our methodology carries several important implications. First, it naturally serves as a diagnostic tool for diffusion models by analyzing specific features of generated data. Second, it sheds light on the model autophagy disorder and offers insights into the use of generated data: augmenting real data with generated data is more effective than replacing it. Third, classifier guidance can significantly enhance the realism of generated images.
