Drift No More? Context Equilibria in Multi-Turn LLM Interactions
Vardhan Dongre, Ryan A. Rossi, Viet Dac Lai, David Seunghyun Yoon, Dilek Hakkani-Tür, Trung Bui
TL;DR
This work addresses how context drift unfolds in sustained multi-turn LLM interactions. It introduces a simple, principled dynamical framework that treats drift as a bounded stochastic recurrence, quantified by the turn-wise KL divergence $D_t = D_{\mathrm{KL}}(q_t \;||\; p_t)$ between a test model and a goal-consistent reference. Through synthetic tasks and realistic $\tau$-Bench simulations, the authors show that drift stabilizes at finite equilibria and that lightweight reminder interventions reliably lower the equilibrium divergence in line with the proposed model. The results suggest that multi-turn drift is a controllable equilibrium phenomenon rather than an inexorable decay, with practical implications for designing robust, long-horizon interactive AI systems.
Abstract
Large Language Models (LLMs) excel at single-turn tasks such as instruction following and summarization, yet real-world deployments require sustained multi-turn interactions where user goals and conversational context persist and evolve. A recurring challenge in this setting is context drift: the gradual divergence of a model's outputs from goal-consistent behavior across turns. Unlike single-turn errors, drift unfolds temporally and is poorly captured by static evaluation metrics. In this work, we present a study of context drift in multi-turn interactions and propose a simple dynamical framework to interpret its behavior. We formalize drift as the turn-wise KL divergence between the token-level predictive distributions of the test model and a goal-consistent reference model, and propose a recurrence model that interprets its evolution as a bounded stochastic process with restoring forces and controllable interventions. We instantiate this framework in both synthetic long-horizon rewriting tasks and realistic user-agent simulations such as in $τ$-Bench, measuring drift for several open-weight LLMs that are used as user simulators. Our experiments consistently reveal stable, noise-limited equilibria rather than runaway degradation, and demonstrate that simple reminder interventions reliably reduce divergence in line with theoretical predictions. Together, these results suggest that multi-turn drift can be understood as a controllable equilibrium phenomenon rather than as inevitable decay, providing a foundation for studying and mitigating context drift in extended interactions.
