Competition Dynamics Shape Algorithmic Phases of In-Context Learning
Core Francisco Park, Ekdeep Singh Lubana, Itamar Pres, Hidenori Tanaka
TL;DR
This work argues that in-context learning (ICL) can be understood as a competition among multiple algorithmic strategies rather than a single capability. By training transformers on a synthetic task that simulates a finite mixture of Markov chains, the authors reproduce core ICL phenomena and identify four distinct algorithms—two retrieval-based (Uni-Ret, Bi-Ret) and two inference-based (Uni-Inf, Bi-Inf)—that compete to drive next-token predictions. They introduce a linear-interpolation framework (LIA) to decompose model outputs into a convex combination of these algorithms, revealing phase diagrams that shift with data diversity, training steps, and architecture, and explaining non-monotonic OOD performance and transience in ICL. The findings offer a unified, mechanistic lens on ICL, with implications for how to design data, models, and training protocols to promote robust, generalizable in-context reasoning rather than memorization or monolithic capabilities.
Abstract
In-Context Learning (ICL) has significantly expanded the general-purpose nature of large language models, allowing them to adapt to novel tasks using merely the inputted context. This has motivated a series of papers that analyze tractable synthetic domains and postulate precise mechanisms that may underlie ICL. However, the use of relatively distinct setups that often lack a sequence modeling nature to them makes it unclear how general the reported insights from such studies are. Motivated by this, we propose a synthetic sequence modeling task that involves learning to simulate a finite mixture of Markov chains. As we show, models trained on this task reproduce most well-known results on ICL, hence offering a unified setting for studying the concept. Building on this setup, we demonstrate we can explain a model's behavior by decomposing it into four broad algorithms that combine a fuzzy retrieval vs. inference approach with either unigram or bigram statistics of the context. These algorithms engage in a competition dynamics to dominate model behavior, with the precise experimental conditions dictating which algorithm ends up superseding others: e.g., we find merely varying context size or amount of training yields (at times sharp) transitions between which algorithm dictates the model behavior, revealing a mechanism that explains the transient nature of ICL. In this sense, we argue ICL is best thought of as a mixture of different algorithms, each with its own peculiarities, instead of a monolithic capability. This also implies that making general claims about ICL that hold universally across all settings may be infeasible.
