Aspects of human memory and Large Language Models
Romuald A. Janik
TL;DR
The paper investigates whether Large Language Models exhibit memory properties akin to human memory by viewing memory as a functional aspect of the probabilistic next-token model $P(\text{token}|\text{preceding text})$. Through experiments that adapt serial-memory tasks to LLMs (using has-a, is-a, lives-in facts) and evaluating recall via noun probability rankings, the authors show human-like primacy and recency effects, memory enhancement from elaborations, and forgetting dominated by interference, with a notable LLM-specific memory formation time. The findings suggest these memory-like behaviors emerge from the statistics of the training data rather than an explicit memory subsystem, supporting the view that linguistic structure and human memory imprint one another. This has implications for cognitive science and AI, indicating a close interplay between biological memory effects and statistical language structure in shaping narrative coherence and memory phenomena.
Abstract
Large Language Models (LLMs) are huge artificial neural networks which primarily serve to generate text, but also provide a very sophisticated probabilistic model of language use. Since generating a semantically consistent text requires a form of effective memory, we investigate the memory properties of LLMs and find surprising similarities with key characteristics of human memory. We argue that the human-like memory properties of the Large Language Model do not follow automatically from the LLM architecture but are rather learned from the statistics of the training textual data. These results strongly suggest that the biological features of human memory leave an imprint on the way that we structure our textual narratives.
