Confabulation: The Surprising Value of Large Language Model Hallucinations

Peiqi Sui; Eamon Duede; Sophie Wu; Richard Jean So

Confabulation: The Surprising Value of Large Language Model Hallucinations

Peiqi Sui, Eamon Duede, Sophie Wu, Richard Jean So

TL;DR

It is argued and empirically demonstrate that measurable semantic characteristics of LLM confabulations mirror a human propensity to utilize increased narrativity as a cognitive resource for sense-making and communication, and suggests that the tendency for LLMs to confabulate may be intimately associated with a positive capacity for coherent narrative-text generation.

Abstract

This paper presents a systematic defense of large language model (LLM) hallucinations or 'confabulations' as a potential resource instead of a categorically negative pitfall. The standard view is that confabulations are inherently problematic and AI research should eliminate this flaw. In this paper, we argue and empirically demonstrate that measurable semantic characteristics of LLM confabulations mirror a human propensity to utilize increased narrativity as a cognitive resource for sense-making and communication. In other words, it has potential value. Specifically, we analyze popular hallucination benchmarks and reveal that hallucinated outputs display increased levels of narrativity and semantic coherence relative to veridical outputs. This finding reveals a tension in our usually dismissive understandings of confabulation. It suggests, counter-intuitively, that the tendency for LLMs to confabulate may be intimately associated with a positive capacity for coherent narrative-text generation.

Confabulation: The Surprising Value of Large Language Model Hallucinations

TL;DR

Abstract

Paper Structure (14 sections, 1 figure, 4 tables)

This paper contains 14 sections, 1 figure, 4 tables.

Background
Related Work
Confabulation vs Hallucination
Towards a Narrative-Centered Definition of Confabulation
Data, Methods, and Results
Empirical Results for Higher Narrativity in Hallucinations
In defense of confabulation
Empirical Support for Association of Narrativity and Coherence in Confabulated Texts
Narrative, Discourse, and Coherence
Narratives Help Us Articulate and Understand Complex Arguments
Narratives Maintain the Consistency of Our Own Internal World Models
Narratives Enable Patients to Negotiate the Coherence of Their Experiences
Limitations and Directions for Future Research
Acknowledgement

Figures (1)

Figure 1: The left panel illustrates distribution for narrative score of hallucinated outputs (blue) and the edited version of the output (gray) in the FaithDial dataset. The hallucinated texts are, in general more narrative rich than those that are edited to resolve inaccuracies. The right panel illustrates distribution for non-hallucinated texts from the FaithDial dataset.

Confabulation: The Surprising Value of Large Language Model Hallucinations

TL;DR

Abstract

Confabulation: The Surprising Value of Large Language Model Hallucinations

Authors

TL;DR

Abstract

Table of Contents

Figures (1)