Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images
Jonathan Brokman, Amit Giloni, Omer Hofman, Roman Vainshtein, Hisashi Kojima, Guy Gilboa
TL;DR
The paper tackles the challenge of distinguishing real from AI-generated images in zero-shot and few-shot regimes. It builds a theoretical framework around biases on the learned probability manifold of pre-trained diffusion models, using score-function analysis to define curvature and gradient-based criteria that quantify generation biases. A three-term detector $C(x_0)$ combines curvature, gradient magnitude, and a bias term to enable zero-shot detection, with extensive evaluation across 20 generative models showing state-of-the-art performance and robust generalization. It also demonstrates practical gains in a mixture-of-experts few-shot setting, highlighting the approach's potential for scalable content verification without constant data maintenance.
Abstract
Distinguishing between real and AI-generated images, commonly referred to as 'image detection', presents a timely and significant challenge. Despite extensive research in the (semi-)supervised regime, zero-shot and few-shot solutions have only recently emerged as promising alternatives. Their main advantage is in alleviating the ongoing data maintenance, which quickly becomes outdated due to advances in generative technologies. We identify two main gaps: (1) a lack of theoretical grounding for the methods, and (2) significant room for performance improvements in zero-shot and few-shot regimes. Our approach is founded on understanding and quantifying the biases inherent in generated content, where we use these quantities as criteria for characterizing generated images. Specifically, we explore the biases of the implicit probability manifold, captured by a pre-trained diffusion model. Through score-function analysis, we approximate the curvature, gradient, and bias towards points on the probability manifold, establishing criteria for detection in the zero-shot regime. We further extend our contribution to the few-shot setting by employing a mixture-of-experts methodology. Empirical results across 20 generative models demonstrate that our method outperforms current approaches in both zero-shot and few-shot settings. This work advances the theoretical understanding and practical usage of generated content biases through the lens of manifold analysis.
