Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing
Zhilin Wang, Yafu Li, Jianhao Yan, Yu Cheng, Yue Zhang
TL;DR
This work treats successive paraphrasing by large language models as a discrete dynamical system, modeling the process as $T_{n+1}=P(T_n)$ over a finite text space $\mathcal{T}$. It empirically uncovers robust 2-period attractor cycles across multiple models and languages, showing that generation converges to a limited, oscillatory set of paraphrases rather than exploring vast linguistic variation. The study demonstrates that increasing randomness or alternating prompts only modestly disrupts these cycles, with invertibility of paraphrase tasks further supporting attractor formation. It then proposes perturbations, history-aware transformations, and perplexity-based sampling to mitigate small cycles, and validates a data-augmentation benefit when these strategies suppress attractors, offering a dynamical-systems perspective on LLM expressivity and avenues for expanding generative diversity.
Abstract
Dynamical systems theory provides a framework for analyzing iterative processes and evolution over time. Within such systems, repetitive transformations can lead to stable configurations, known as attractors, including fixed points and limit cycles. Applying this perspective to large language models (LLMs), which iteratively map input text to output text, provides a principled approach to characterizing long-term behaviors. Successive paraphrasing serves as a compelling testbed for exploring such dynamics, as paraphrases re-express the same underlying meaning with linguistic variation. Although LLMs are expected to explore a diverse set of paraphrases in the text space, our study reveals that successive paraphrasing converges to stable periodic states, such as 2-period attractor cycles, limiting linguistic diversity. This phenomenon is attributed to the self-reinforcing nature of LLMs, as they iteratively favour and amplify certain textual forms over others. This pattern persists with increasing generation randomness or alternating prompts and LLMs. These findings underscore inherent constraints in LLM generative capability, while offering a novel dynamical systems perspective for studying their expressive potential.
