Cognitive Inception: Agentic Reasoning against Visual Deceptions by Injecting Skepticism
Yinjie Zhao, Heng Zhao, Bihan Wen, Joey Tianyi Zhou
TL;DR
This work tackles the vulnerability of multi-modal LLMs to AIGC visual deceptions by introducing Inception, a fully reasoning-based agentic framework that injects skepticism into the inference process. It models skepticism as iterative, dual-agent reasoning between External and Internal Skeptics, coupled with a logic-verification mechanism that treats valid skeptical logics as evidence of AI-generated content when sufficient conditions are found. The approach achieves state-of-the-art results on the AEGIS benchmark and strong gains on Forensics-Bench, demonstrating robust generalization across video and image modalities without feature-based detectors. By formalizing skeptical logics, their verification, and iterative expansion, the method improves authenticity verification while providing interpretable reasoning traces, signaling practical impact for trustworthy AI perception. The framework’s reliance on reasoning dynamics rather than training on new detectors suggests broad applicability to emerging generative visual threats and evolving data distributions.
Abstract
As the development of AI-generated contents (AIGC), multi-modal Large Language Models (LLM) struggle to identify generated visual inputs from real ones. Such shortcoming causes vulnerability against visual deceptions, where the models are deceived by generated contents, and the reliability of reasoning processes is jeopardized. Therefore, facing rapidly emerging generative models and diverse data distribution, it is of vital importance to improve LLMs' generalizable reasoning to verify the authenticity of visual inputs against potential deceptions. Inspired by human cognitive processes, we discovered that LLMs exhibit tendency of over-trusting the visual inputs, while injecting skepticism could significantly improve the models visual cognitive capability against visual deceptions. Based on this discovery, we propose \textbf{Inception}, a fully reasoning-based agentic reasoning framework to conduct generalizable authenticity verification by injecting skepticism, where LLMs' reasoning logic is iteratively enhanced between External Skeptic and Internal Skeptic agents. To the best of our knowledge, this is the first fully reasoning-based framework against AIGC visual deceptions. Our approach achieved a large margin of performance improvement over the strongest existing LLM baselines and SOTA performance on AEGIS benchmark.
