A Bias-Free Training Paradigm for More General AI-generated Image Detection

Fabrizio Guillaro; Giada Zingarini; Ben Usman; Avneesh Sud; Davide Cozzolino; Luisa Verdoliva

A Bias-Free Training Paradigm for More General AI-generated Image Detection

Fabrizio Guillaro, Giada Zingarini, Ben Usman, Avneesh Sud, Davide Cozzolino, Luisa Verdoliva

TL;DR

The paper addresses the generalization gap in AI-generated image detection caused by dataset biases. It introduces B-Free, a bias-free training paradigm that generates semantically aligned fake images using self-conditioned reconstructions and content augmentations via Stable Diffusion 2.1, and trains a ViT-based detector end-to-end on large, non-resized crops. By assembling a bias-controlled dataset (51k real, 309k fake) and evaluating across 27 generators with metrics including AUC and calibration (ECE, NLL), the authors demonstrate improved generalization to unseen generators and better calibration. The key finding is that careful dataset design and content-aligned augmentation can outperform more complex algorithms, highlighting the importance of reducing biases to achieve robust forensic detection in real-world settings.

Abstract

Successful forensic detectors can produce excellent results in supervised learning benchmarks but struggle to transfer to real-world applications. We believe this limitation is largely due to inadequate training data quality. While most research focuses on developing new algorithms, less attention is given to training data selection, despite evidence that performance can be strongly impacted by spurious correlations such as content, format, or resolution. A well-designed forensic detector should detect generator specific artifacts rather than reflect data biases. To this end, we propose B-Free, a bias-free training paradigm, where fake images are generated from real ones using the conditioning procedure of stable diffusion models. This ensures semantic alignment between real and fake images, allowing any differences to stem solely from the subtle artifacts introduced by AI generation. Through content-based augmentation, we show significant improvements in both generalization and robustness over state-of-the-art detectors and more calibrated results across 27 different generative models, including recent releases, like FLUX and Stable Diffusion 3.5. Our findings emphasize the importance of a careful dataset design, highlighting the need for further research on this topic. Code and data are publicly available at https://grip-unina.github.io/B-Free/.

A Bias-Free Training Paradigm for More General AI-generated Image Detection

TL;DR

Abstract

A Bias-Free Training Paradigm for More General AI-generated Image Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)