What We Talk About When We Talk About LMs: Implicit Paradigm Shifts and the Ship of Language Models
Shengqi Zhu, Jeffrey M. Rzeszotarski
TL;DR
The paper investigates how the term Language Models (LMs) functions as a time-variant referent, akin to a Ship of Theseus, across NLP literature. It builds a data infrastructure from 7,650 papers in ACL, EMNLP, and NAACL (2020–2023), introducing two keyword sets: $oldsymbol{\mathcal{L}}$ for collective LM mentions and $oldsymbol{\mathcal{M}}$ for specific models, with model names automatically detected via a GPT-4-turbo workflow and manually validated to yield 103 models and 155 aliases. Through quantitative analyses of $N^{\mathcal{L}}$, $N_m$, and related metrics, the study reveals a dramatic rise in LM discourse post-2021, a shift in referents from older archetypes (e.g., BERT) toward newer generative models (e.g., GPT-family, LLaMA, T5 variants), and a decreasing cross-conference similarity in model compositions. The findings highlight how a stable, high-level term can mask substantial changes in its concrete referents, underscoring the need for fine-grained, diachronic analyses to interpret scientific progress and guide long-term knowledge management in fast-evolving fields.
Abstract
The term Language Models (LMs) as a time-specific collection of models of interest is constantly reinvented, with its referents updated much like the $\textit{Ship of Theseus}$ replaces its parts but remains the same ship in essence. In this paper, we investigate this $\textit{Ship of Language Models}$ problem, wherein scientific evolution takes the form of continuous, implicit retrofits of key existing terms. We seek to initiate a novel perspective of scientific progress, in addition to the more well-studied emergence of new terms. To this end, we construct the data infrastructure based on recent NLP publications. Then, we perform a series of text-based analyses toward a detailed, quantitative understanding of the use of Language Models as a term of art. Our work highlights how systems and theories influence each other in scientific discourse, and we call for attention to the transformation of this Ship that we all are contributing to.
