From Evidence to Verdict: An Agent-Based Forensic Framework for AI-Generated Image Detection

Mengfei Liang; Yiting Qu; Yukun Jiang; Michael Backes; Yang Zhang

From Evidence to Verdict: An Agent-Based Forensic Framework for AI-Generated Image Detection

Mengfei Liang, Yiting Qu, Yukun Jiang, Michael Backes, Yang Zhang

TL;DR

The paper presents AIFo, a training-free, agent-based forensic framework for AI-generated image detection. By coordinating a toolbox of external forensic tools through specialized LLM-based agents and a structured debate mechanism, AIFo achieves high accuracy and robustness across lab and in-the-wild data, outperforming traditional classifiers and single VLMs. A memory-augmented reasoning module further enhances performance by leveraging historical cases to refine evidence weighting and recover misclassifications. The work demonstrates a shift from static detectors to interpretable, procedural reasoning that can adapt to rapid evolution in generative models, offering a scalable approach for information integrity and media authenticity.

Abstract

The rapid evolution of AI-generated images poses unprecedented challenges to information integrity and media authenticity. Existing detection approaches suffer from fundamental limitations: traditional classifiers lack interpretability and fail to generalize across evolving generative models, while vision-language models (VLMs), despite their promise, remain constrained to single-shot analysis and pixel-level reasoning. To address these challenges, we introduce AIFo (Agent-based Image Forensics), a novel training-free framework that emulates human forensic investigation through multi-agent collaboration. Unlike conventional methods, our framework employs a set of forensic tools, including reverse image search, metadata extraction, pre-trained classifiers, and VLM analysis, coordinated by specialized LLM-based agents that collect, synthesize, and reason over cross-source evidence. When evidence is conflicting or insufficient, a structured multi-agent debate mechanism allows agents to exchange arguments and reach a reliable conclusion. Furthermore, we enhance the framework with a memory-augmented reasoning module that learns from historical cases to improve future detection accuracy. Our comprehensive evaluation spans 6,000 images across both controlled laboratory settings and challenging real-world scenarios, including images from modern generative platforms and diverse online sources. AIFo achieves 97.05% accuracy, substantially outperforming traditional classifiers and state-of-the-art VLMs. These results demonstrate that agent-based procedural reasoning offers a new paradigm for more robust, interpretable, and adaptable AI-generated image detection.

From Evidence to Verdict: An Agent-Based Forensic Framework for AI-Generated Image Detection

TL;DR

Abstract

From Evidence to Verdict: An Agent-Based Forensic Framework for AI-Generated Image Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)