Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in Multimodal LLMs
Supratik Sarkar, Swagatam Das
TL;DR
The paper introduces a principled spectral-graph diffusion framework to quantify hallucinations in multimodal LLMs by embedding outputs on a multimodal graph Laplacian within an RKHS. It defines a KL-calibrated, smoothed semantic distortion score and an energy-based hallucination measure that decompose across modalities, with time-indexed diffusion and temperature parameters. The core contributions include a multimodal Laplacian, spectral decomposition, Courant–Fischer bounds, and calibration strategies that bound hallucination energy and reveal its modal origins. Empirical validation across COCO, VQAv2, and AudioCaps with multiple inference stacks demonstrates improved detection performance and interpretable energy dynamics, providing a principled basis for evaluation and potential mitigation.
Abstract
Hallucinations in LLMs--especially in multimodal settings--undermine reliability. We present a rigorous information-geometric framework, grounded in diffusion dynamics, to quantify hallucinations in MLLMs where model outputs are embedded via spectral decompositions of multimodal graph Laplacians, and their gaps to a truth manifold define a semantic distortion metric. We derive Courant-Fischer bounds on a temperature-dependent hallucination profile and use RKHS eigenmodes to obtain modality-aware, interpretable measures that track evolution over prompts and time. This reframes hallucination as quantifiable and bounded, providing a principled basis for evaluation and mitigation.
