SFLD: Reducing the content bias for AI-generated Image Detection
Seoyeon Gye, Junwon Ko, Hyounguk Shon, Minchan Kwon, Junmo Kim
TL;DR
This paper tackles AI-generated image detection by addressing generalization gaps across unseen generators and degradation-prone fingerprints. It introduces SFLD, a patch-shuffling detector that ensembles classifiers trained on multiple patch sizes to fuse high-level semantic cues with low-level texture artifacts, using a CLIP-ViT-L/14 backbone. It also introduces TwinSynths, a benchmark of visually near-identical real–fake image pairs to evaluate content preservation and generalization. Empirical results show SFLD achieves state-of-the-art AP on conventional benchmarks, maintains high performance on unseen generators, and demonstrates robustness to Gaussian blur and JPEG degradation, with strong results on low-level vision/perceptual benchmarks. The work provides a practical detector with broad generalization and a benchmarking framework for realistic evaluation in real-world settings.
Abstract
Identifying AI-generated content is critical for the safe and ethical use of generative AI. Recent research has focused on developing detectors that generalize to unknown generators, with popular methods relying either on high-level features or low-level fingerprints. However, these methods have clear limitations: biased towards unseen content, or vulnerable to common image degradations, such as JPEG compression. To address these issues, we propose a novel approach, SFLD, which incorporates PatchShuffle to integrate high-level semantic and low-level textural information. SFLD applies PatchShuffle at multiple levels, improving robustness and generalization across various generative models. Additionally, current benchmarks face challenges such as low image quality, insufficient content preservation, and limited class diversity. In response, we introduce TwinSynths, a new benchmark generation methodology that constructs visually near-identical pairs of real and synthetic images to ensure high quality and content preservation. Our extensive experiments and analysis show that SFLD outperforms existing methods on detecting a wide variety of fake images sourced from GANs, diffusion models, and TwinSynths, demonstrating the state-of-the-art performance and generalization capabilities to novel generative models.
