Hallucination as a Computational Boundary: A Hierarchy of Inevitability and the Oracle Escape
Wang Xi, Quan Shi, Zenghui Ding, Jianqing Gao, Xianjun Yang
TL;DR
This work formalizes hallucination as an intrinsic limit when LLMs are modeled as probabilistic Turing machines, introducing a three-boundary hierarchy (Diagonalization, Uncomputability, Information-Theoretic) and two escape strategies: external Oracle augmentation (RAG) and internal continual learning (CL). It develops a neuro-game-theoretic, CLS-inspired framework for adaptive escape, proves amortized cost advantages for continual learning, and introduces Computational Class Alignment (CCA) as a safety principle. The experimental validation with a RAG-CL hybrid demonstrates robust accuracy with cost benefits, supporting dynamic alignment of task complexity to the agent's computational class. Overall, the paper shifts safety design from single-model fixes to ecosystemwide, boundary-aware strategies that adaptively elevate internal capabilities.
Abstract
The illusion phenomenon of large language models (LLMs) is the core obstacle to their reliable deployment. This article formalizes the large language model as a probabilistic Turing machine by constructing a "computational necessity hierarchy", and for the first time proves the illusions are inevitable on diagonalization, incomputability, and information theory boundaries supported by the new "learner pump lemma". However, we propose two "escape routes": one is to model Retrieval Enhanced Generations (RAGs) as oracle machines, proving their absolute escape through "computational jumps", providing the first formal theory for the effectiveness of RAGs; The second is to formalize continuous learning as an "internalized oracle" mechanism and implement this path through a novel neural game theory framework. Finally, this article proposes a feasible new principle for artificial intelligence security - Computational Class Alignment (CCA), which requires strict matching between task complexity and the actual computing power of the system, providing theoretical support for the secure application of artificial intelligence.
