InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
Zhongyu Yang, Yingfang Yuan, Xuanming Jiang, Baoyi An, Wei Pang
TL;DR
This work tackles hallucination in multimodal LLMs by introducing InEx, a training-free framework that blends internal introspective reasoning guided by TVER with external cross-modal collaboration involving textual and visual self-reflection and an image-editing agent. The approach iteratively verifies and refines responses until cross-modal consensus is achieved, underpinned by theoretical results linking information flow and the information bottleneck. Empirical results across hallucination and general-purpose benchmarks show that InEx consistently outperforms strong baselines, with robust ablations and significance testing supporting the contributions. The method demonstrates a path toward autonomous, reliable multimodal reasoning without model retraining, with potential extension to additional modalities.
Abstract
Hallucination remains a critical challenge in large language models (LLMs), hindering the development of reliable multimodal LLMs (MLLMs). Existing solutions often rely on human intervention or underutilize the agent's ability to autonomously mitigate hallucination. To address these limitations, we draw inspiration from how humans make reliable decisions in the real world. They begin with introspective reasoning to reduce uncertainty and form an initial judgment, then rely on external verification from diverse perspectives to reach a final decision. Motivated by this cognitive paradigm, we propose InEx, a training-free, multi-agent framework designed to autonomously mitigate hallucination. InEx introduces internal introspective reasoning, guided by entropy-based uncertainty estimation, to improve the reliability of the decision agent's reasoning process. The agent first generates a response, which is then iteratively verified and refined through external cross-modal multi-agent collaboration with the editing agent and self-reflection agents, further enhancing reliability and mitigating hallucination. Extensive experiments show that InEx consistently outperforms existing methods, achieving 4%-27% gains on general and hallucination benchmarks, and demonstrating strong robustness.
