Emergence of a High-Dimensional Abstraction Phase in Language Transformers
Emily Cheng, Diego Doimo, Corentin Kervadec, Iuri Macocco, Jade Yu, Alessandro Laio, Marco Baroni
TL;DR
This work investigates how transformer language models organize linguistic information by analyzing the evolving intrinsic dimension (ID) of layer representations using GRIDE across five LMs and three corpora. It introduces the Information Imbalance Delta to study the neighborhood structure of representations and cross-model similarities. A central high-ID phase appears in intermediate layers, marking a transition to abstract syntactic and semantic processing and predicting downstream transfer performance. The findings reveal cross-model geometric convergence at the ID peak and have practical implications for layer-wise pruning, fine-tuning, and model interfacing, suggesting that core linguistic processing concentrates in a distinct mid-layer phase.
Abstract
A language model (LM) is a mapping from a linguistic context to an output token. However, much remains to be known about this mapping, including how its geometric properties relate to its function. We take a high-level geometric approach to its analysis, observing, across five pre-trained transformer-based LMs and three input datasets, a distinct phase characterized by high intrinsic dimensionality. During this phase, representations (1) correspond to the first full linguistic abstraction of the input; (2) are the first to viably transfer to downstream tasks; (3) predict each other across different LMs. Moreover, we find that an earlier onset of the phase strongly predicts better language modelling performance. In short, our results suggest that a central high-dimensionality phase underlies core linguistic processing in many common LM architectures.
