`Generalization is hallucination' through the lens of tensor completions
Liang Ze Wong
TL;DR
The paper proposes a tensor-completion framework to unify generalization and hallucination in language generation, formalizing $n$-dimensional data tensors $D^n$ and their completions $D'$, and defining completion artifacts as high-probability, novel $n$-grams outside observed data. It argues that artifacts arise from partially observed training fibers and rank constraints, and that some artifacts may function as useful generalizations while others constitute hallucinations; the boundary depends on external validation data. The authors provide toy experiments showing artifacts proliferate when model size is reduced and discuss analogies to recommender systems, highlighting a generalization-hallination trade-off. They also outline mitigation strategies that involve expanding the training space to include surrounding contexts and adjusting the loss to penalize unsupported high-prob predictions, while noting substantial limitations and the need for scalable methods to quantify artifacts in large models. Overall, the work offers a theoretical lens to connect compression, overfitting, and generation behavior, suggesting concrete avenues for measuring and mitigating artifacts without neglecting beneficial generalization.
Abstract
In this short position paper, we introduce tensor completions and artifacts and make the case that they are a useful theoretical framework for understanding certain types of hallucinations and generalizations in language models.
