Mitigating Hallucination in Multimodal Reasoning via Functional Attention Control
Haolang Lu, Bolun Chu, WeiYe Fu, Guoshun Nan, Junning Liu, Minghui Pan, Qiankun Li, Yi Yu, Hua Wang, Kun Wang
TL;DR
The paper addresses hallucination in multimodal reasoning models by identifying a perception–then–reasoning dynamic and two root failure modes: perceptual bias in early layers and reasoning drift in deeper layers. It introduces a lightweight two-step plugin—Functional Head Identification and Class-conditioned Rescaling—that identifies perception- and reasoning-oriented attention heads and selectively amplifies their contributions without retraining. Across three real-world MLRMs and six benchmarks, the approach yields consistent performance gains with minimal computational overhead, delivering balanced improvements on both perception- and reasoning-heavy tasks and improving interpretability. The method is model-agnostic and plug-and-play, offering a practical path toward safer, more reliable multimodal reasoning in high-stakes applications.
Abstract
Multimodal large reasoning models (MLRMs) are rapidly advancing vision-language reasoning and are emerging as a foundation for cross-modal intelligence. Hallucination remains a persistent failure mode, manifesting itself as erroneous reasoning chains and misinterpretation of visual content. In this study, we observe that attention heads exhibit a staged division: shallow heads predominantly serve perception, while deeper heads shift toward symbolic reasoning, revealing two major causes of hallucination, namely perceptual bias and reasoning drift. To address these issues, we propose a lightweight and interpretable two-step plugin, Functional Head Identification and Class-conditioned Rescaling, which locates perception- and reasoning-oriented heads and regulates their contributions without retraining. Evaluations on three real-world MLRMs (Kimi-VL, Ocean-R1, R1-Onevision), six benchmarks across three domains, and four baselines show that our plugin achieves an average improvement of 5% and up to 15%, with only <1% additional computation and 9% of baseline latency. Our approach is completely model-agnostic and significantly enhances both the reliability and interpretability of the off-the-shelf MLRMs, thereby enabling their safe deployment in high-stakes applications. Our code is available at https://anonymous.4open.science/r/Functional-Attention-Control.
