Shared Imagination: LLMs Hallucinate Alike
Yilun Zhou, Caiming Xiong, Silvio Savarese, Chien-Sheng Wu
TL;DR
The paper introduces Imaginary Question Answering (IQA) as a probe of cross-model similarity among large language models, revealing a strong shared imagination space where models reliably answer purely fictional questions across families (GPT, Claude, Mistral, Llama 3). By prompting 13 models to generate direct or context-based imaginary questions and evaluating answers from potentially different AMs, the study finds substantial non-random correctness, especially for context-based questions (average κ ≈ 54% for DQs and ≈ 86% for CQs, with peaks up to 96%), suggesting fundamental commonalities in their imaginative inferencing. Extensive analyses show the phenomenon persists across topics, is partially explained by data characteristics and generation order but not by simple heuristics like perplexity, and is influenced by factors such as prompt ordering and question length. The work discusses implications for model homogeneity, hallucination detection, and computational creativity, and points to future work including broader model families, mechanistic interpretability, and alternative reasoning prompts to further explore the shared-imagination phenomenon.
Abstract
Despite the recent proliferation of large language models (LLMs), their training recipes -- model architecture, pre-training data and optimization algorithm -- are often very similar. This naturally raises the question of the similarity among the resulting models. In this paper, we propose a novel setting, imaginary question answering (IQA), to better understand model similarity. In IQA, we ask one model to generate purely imaginary questions (e.g., on completely made-up concepts in physics) and prompt another model to answer. Surprisingly, despite the total fictionality of these questions, all models can answer each other's questions with remarkable success, suggesting a "shared imagination space" in which these models operate during such hallucinations. We conduct a series of investigations into this phenomenon and discuss implications on model homogeneity, hallucination, and computational creativity.
