Audit-of-Understanding: Posterior-Constrained Inference for Mathematical Reasoning in Language Models
Samir Abdaljalil, Erchin Serpedin, Khalid Qaraqe, Hasan Kurban
TL;DR
Audit-of-Understanding (AoU) addresses reasoning-induced hallucinations by constraining inference to validated premises. It formalizes posterior-constrained inference: decompose a query into candidate premises $\mathcal{G}$, audit their support to form $\mathcal{G}^+$, and condition the final prediction on $\mathcal{G}^+$. The framework provides theoretical guarantees under perfect validation and excess-risk bounds under imperfect validation, plus tractability analysis. Empirically, AoU yields substantial accuracy and faithfulness gains on GSM8K, MultiArith, and SVAMP across multiple models, without external tools, outperforming several prompting baselines. This approach yields robust, interpretable reasoning traces and opens avenues for extending principled auditing to broader reasoning tasks and tool integrations.
Abstract
Large language models (LLMs) often generate reasoning traces that appear coherent but rest on unsupported assumptions, leading to hallucinated conclusions. Prior work mainly addresses factual hallucinations or relies on post-hoc verification, leaving reasoning-induced hallucinations largely unaddressed. We propose Audit-of-Understanding (AoU), a framework that constrains inference to validated premises through three phases: (1) decomposing a query into candidate assumptions, (2) auditing their support, and (3) conditioning inference only on the validated subset. Formally, AoU is \emph{posterior-constrained inference}, connecting to selective prediction and rejection learning. Our contributions are threefold: (i) theoretical guarantees under perfect validation, (ii) excess-risk bounds under imperfect audits, and (iii) tractability analysis. Empirically, AoU improves both accuracy and faithfulness on GSM8K, MultiArith, and SVAMP, achieving up to +30% gains on GSM8K, +45% on MultiArith, and consistent +20--28% improvements on SVAMP over Chain-of-Thought, Self-Consistency, and CoT-Decoding. Code is available at https://anonymous.4open.science/r/audit-of-understanding-E28B.
