Why LLMs Cannot Think and How to Fix It
Marius Jahrens, Thomas Martinetz
TL;DR
The paper tackles the question of whether LLMs can truly think by formalizing a notion of thought and showing that current decoder-only, text-space training cannot realize genuine internal deliberation. It argues that two design choices—deterministic model state and population-level training—prevent thoughts from collapsing into instance-specific decisions within the feature space, offering a theoretical framework built around a conditional mutual information formulation $I(State_t; Y_t | X_{1, dots,t})$. The authors propose architectural and training changes that inject non-determinism and align internal decisions with conversation instances, suggesting randomness injections and targeted fine-tuning (e.g., ORPO-like strategies) to enable feature-space thought processes. They discuss potential benefits, including improved reasoning and planning, reduced symmetry-related inefficiencies, and possible latency gains through speculative decoding, while acknowledging limitations such as depth-bound reasoning and the need for recurrence or parallelism. This work lays groundwork for empirical validation and guides future research toward enabling true internal deliberation in LLMs with practical implications for multimodal and embodied AI.
Abstract
This paper elucidates that current state-of-the-art Large Language Models (LLMs) are fundamentally incapable of making decisions or developing "thoughts" within the feature space due to their architectural constraints. We establish a definition of "thought" that encompasses traditional understandings of that term and adapt it for application to LLMs. We demonstrate that the architectural design and language modeling training methodology of contemporary LLMs inherently preclude them from engaging in genuine thought processes. Our primary focus is on this theoretical realization rather than practical insights derived from experimental data. Finally, we propose solutions to enable thought processes within the feature space and discuss the broader implications of these architectural modifications.
