Distributional reasoning in LLMs: Parallel reasoning processes in multi-hop reasoning
Yuval Shalev, Amir Feder, Ariel Goldstein
TL;DR
This work investigates whether large language models perform distributional, parallel multi-hop reasoning rather than a single-step inference. It introduces a two-stage, linear-approximation framework in which middle-layer intermediate-answer activations (A1) linearly predict final-answer activations (A2) via a subject-invariant matrix Q2, and demonstrates interpretable, phase-transition dynamics in hidden embeddings. Using the Compositional Celebrities dataset and a Hallucinations variant, the authors show that after two-thirds depth the A1-to-A2 relationship is strong (mean R^2 > 0.5), that intermediate activations are interpretable and aligned with final outputs, and that the same reasoning process generalizes to fictitious or out-of-distribution subjects. These findings suggest a cognitive-inspired blend of association and propositional reasoning in LLMs and provide a framework for probing internal thought processes. The results advance cognitive modeling of AI by linking activations to parallel reasoning paths and offering interpretable diagnostics for reasoning strategies.
Abstract
Large language models (LLMs) have shown an impressive ability to perform tasks believed to require thought processes. When the model does not document an explicit thought process, it becomes difficult to understand the processes occurring within its hidden layers and to determine if these processes can be referred to as reasoning. We introduce a novel and interpretable analysis of internal multi-hop reasoning processes in LLMs. We demonstrate that the prediction process for compositional reasoning questions can be modeled using a simple linear transformation between two semantic category spaces. We show that during inference, the middle layers of the network generate highly interpretable embeddings that represent a set of potential intermediate answers for the multi-hop question. We use statistical analyses to show that a corresponding subset of tokens is activated in the model's output, implying the existence of parallel reasoning paths. These observations hold true even when the model lacks the necessary knowledge to solve the task. Our findings can help uncover the strategies that LLMs use to solve reasoning tasks, offering insights into the types of thought processes that can emerge from artificial intelligence. Finally, we also discuss the implication of cognitive modeling of these results.
