Table of Contents
Fetching ...

Hallucinate Less by Thinking More: Aspect-Based Causal Abstention for Large Language Models

Vy Nguyen, Ziqi Xu, Jeffrey Chan, Estrid He, Feng Xia, Xiuzhen Zhang

TL;DR

ABCA addresses hallucinations in large language models by enabling pre-generation abstention through aspect-conditioned causal inference. It formalizes a two-stage framework where Stage 1 discovers interpretable aspects and Stage 2 estimates aspect-conditioned causal effects using Augmented Inverse Probability Weighting, yielding P(A | do(Q), X) via P(c | Q, X) P(A | c, Q, X). An abstention policy based on Centroid Angular Deviation detects knowledge conflicts (Type-1) or insufficiency (Type-2) and uses aggregation when evidence is consistent, improving reliability and interpretability. Experiments on TruthfulQA, KUQ, AVeriTeC, and AbstainQA show state-of-the-art abstention performance across backbones, with ABCA balancing answering accuracy and abstention quality. This work demonstrates that leveraging internal knowledge diversity through causal reasoning can robustly mitigate hallucinations in practical LLM deployments.

Abstract

Large Language Models (LLMs) often produce fluent but factually incorrect responses, a phenomenon known as hallucination. Abstention, where the model chooses not to answer and instead outputs phrases such as "I don't know", is a common safeguard. However, existing abstention methods typically rely on post-generation signals, such as generation variations or feedback, which limits their ability to prevent unreliable responses in advance. In this paper, we introduce Aspect-Based Causal Abstention (ABCA), a new framework that enables early abstention by analysing the internal diversity of LLM knowledge through causal inference. This diversity reflects the multifaceted nature of parametric knowledge acquired from various sources, representing diverse aspects such as disciplines, legal contexts, or temporal frames. ABCA estimates causal effects conditioned on these aspects to assess the reliability of knowledge relevant to a given query. Based on these estimates, we enable two types of abstention: Type-1, where aspect effects are inconsistent (knowledge conflict), and Type-2, where aspect effects consistently support abstention (knowledge insufficiency). Experiments on standard benchmarks demonstrate that ABCA improves abstention reliability, achieves state-of-the-art performance, and enhances the interpretability of abstention decisions.

Hallucinate Less by Thinking More: Aspect-Based Causal Abstention for Large Language Models

TL;DR

ABCA addresses hallucinations in large language models by enabling pre-generation abstention through aspect-conditioned causal inference. It formalizes a two-stage framework where Stage 1 discovers interpretable aspects and Stage 2 estimates aspect-conditioned causal effects using Augmented Inverse Probability Weighting, yielding P(A | do(Q), X) via P(c | Q, X) P(A | c, Q, X). An abstention policy based on Centroid Angular Deviation detects knowledge conflicts (Type-1) or insufficiency (Type-2) and uses aggregation when evidence is consistent, improving reliability and interpretability. Experiments on TruthfulQA, KUQ, AVeriTeC, and AbstainQA show state-of-the-art abstention performance across backbones, with ABCA balancing answering accuracy and abstention quality. This work demonstrates that leveraging internal knowledge diversity through causal reasoning can robustly mitigate hallucinations in practical LLM deployments.

Abstract

Large Language Models (LLMs) often produce fluent but factually incorrect responses, a phenomenon known as hallucination. Abstention, where the model chooses not to answer and instead outputs phrases such as "I don't know", is a common safeguard. However, existing abstention methods typically rely on post-generation signals, such as generation variations or feedback, which limits their ability to prevent unreliable responses in advance. In this paper, we introduce Aspect-Based Causal Abstention (ABCA), a new framework that enables early abstention by analysing the internal diversity of LLM knowledge through causal inference. This diversity reflects the multifaceted nature of parametric knowledge acquired from various sources, representing diverse aspects such as disciplines, legal contexts, or temporal frames. ABCA estimates causal effects conditioned on these aspects to assess the reliability of knowledge relevant to a given query. Based on these estimates, we enable two types of abstention: Type-1, where aspect effects are inconsistent (knowledge conflict), and Type-2, where aspect effects consistently support abstention (knowledge insufficiency). Experiments on standard benchmarks demonstrate that ABCA improves abstention reliability, achieves state-of-the-art performance, and enhances the interpretability of abstention decisions.

Paper Structure

This paper contains 56 sections, 11 equations, 5 figures, 14 tables, 1 algorithm.

Figures (5)

  • Figure 1: A real question from TruthfulQA Lin2022TruthfulQA with the ground-truth answer I have no comment that should not be answered definitively. Existing abstention methods overlook less dominant knowledge, generating a false response (top). In contrast, ABCA activates diverse aspects of model knowledge and abstains from answering definitively (bottom).
  • Figure 2: Two structural causal models: (a) Reasoning with explicit CoTs; (b) ABCA with aspect conditioning. $Q$ is the query, $A$ is the answer, $C$ is the CoT, $U$ is the unobserved confounders in LLMs, and $X$ is the aspect.
  • Figure 3: Architecture of the Aspect-Based Causal Abstention (ABCA) framework. Stage 1 discovers relevant aspects through causally motivated dual-agent debate, and Stage 2 estimates aspect-conditioned causal effects to inform an abstention policy.
  • Figure 4: Initial responses generated by three commercial LLMs: GPT-4.5 (a), Gemini 2.5 Pro (b), and Sonnet 4 (c).
  • Figure 5: Alternative responses surfaced by conditioning on the written records aspect, as generated by three commercial LLMs: GPT-4.5 (a), Gemini 2.5 Pro (b), and Sonnet 4 (c).

Theorems & Definitions (2)

  • Definition 1: Conditional Independence Pearl2009Causality
  • Definition 2: $d$-Separation Pearl2009Causality