FaithAct: Faithfulness Planning and Acting in MLLMs
Junxian Li, Xinyue Xu, Sai Ma, Di Zhang, Seth Lazar, Sichao Li
TL;DR
The paper tackles unfaithfulness in multimodal large language models (MLLMs) by distinguishing perceptual faithfulness (PF) from behavioral faithfulness (BF) and introducing FaithEval to quantify step- and chain-level grounding. It then presents FaithAct, a faithfulness-first planning framework that verifies evidential grounding at every reasoning step via a lightweight API (Poll, Ground, Select, Abstain, Count) and employs a refine-based loop to ensure decisions are grounded before proceeding. Empirical results on RealWorldQA and MMHal show FaithAct improves perceptual faithfulness by up to up to $26\%$ without compromising task accuracy and reduces object hallucination, outperforming prompt-based and tool-augmented baselines. The work provides a unified framework for evaluating and enforcing faithfulness in multimodal reasoning, with implications for more trustworthy and interpretable MLLMs.
Abstract
Unfaithfulness remains a persistent challenge for large language models (LLMs), which often produce plausible yet ungrounded reasoning chains that diverge from perceptual evidence or final conclusions. We distinguish between behavioral faithfulness (alignment between reasoning and output) and perceptual faithfulness (alignment between reasoning and input), and introduce FaithEval for quantifying step-level and chain-level faithfulness by evaluating whether each claimed object is visually supported by the image. Building on these insights, we propose FaithAct, a faithfulness-first planning and acting framework that enforces evidential grounding at every reasoning step. Experiments across multiple reasoning benchmarks demonstrate that FaithAct improves perceptual faithfulness by up to 26% without degrading task accuracy compared to prompt-based and tool-augmented baselines. Our analysis shows that treating faithfulness as a guiding principle not only mitigates hallucination but also leads to more stable reasoning trajectories. This work thereby establishes a unified framework for both evaluating and enforcing faithfulness in multimodal reasoning.
