How do language models learn facts? Dynamics, curricula and hallucinations

Nicolas Zucchet; Jörg Bornschein; Stephanie Chan; Andrew Lampinen; Razvan Pascanu; Soham De

How do language models learn facts? Dynamics, curricula and hallucinations

Nicolas Zucchet, Jörg Bornschein, Stephanie Chan, Andrew Lampinen, Razvan Pascanu, Soham De

TL;DR

The paper investigates how language models acquire factual knowledge using a synthetic biographies task to disentangle knowledge from memorization. It identifies a three-phase learning dynamic, with a plateau where attention-based recall circuits form, and shows that data distribution and curricula critically shape learning speed and final knowledge. It also reveals that hallucinations accompany knowledge and that fine-tuning often erases prior knowledge, highlighting data-centric strategies as promising avenues to accelerate training and improve robustness. The results motivate data scheduling and curriculum-like approaches to pretraining, and provide mechanistic hypotheses for future validation on larger, more realistic models.

Abstract

Large language models accumulate vast knowledge during pre-training, yet the dynamics governing this acquisition remain poorly understood. This work investigates the learning dynamics of language models on a synthetic factual recall task, uncovering three key findings: First, language models learn in three phases, exhibiting a performance plateau before acquiring precise factual knowledge. Mechanistically, this plateau coincides with the formation of attention-based circuits that support recall. Second, the training data distribution significantly impacts learning dynamics, as imbalanced distributions lead to shorter plateaus. Finally, hallucinations emerge simultaneously with knowledge, and integrating new knowledge into the model through fine-tuning is challenging, as it quickly corrupts its existing parametric memories. Our results emphasize the importance of data distribution in knowledge acquisition and suggest novel data scheduling strategies to accelerate neural network training.

How do language models learn facts? Dynamics, curricula and hallucinations

TL;DR

Abstract

How do language models learn facts? Dynamics, curricula and hallucinations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (29)