A Philosophical Introduction to Language Models -- Part I: Continuity With Classic Debates
Raphaël Millière, Cameron Buckner
TL;DR
The paper addresses whether large language models instantiate genuine linguistic and cognitive competence or merely mimic sophisticated behavior. It surveys historical foundations and transformer architectures, framing the discussion around core philosophical issues: compositionality, language acquisition, grounding, world representations, and cultural transmission, while warning against misapplied reductive inferences. It finds that LLMs challenge several traditional assumptions and exhibit notable generalization and learning-from-context capabilities, yet robust semantic grounding and stable communicative intentions remain unresolved. The work argues for empirically grounded inquiry into internal representations and world-model-like knowledge, setting the stage for Part II’s empirical probing and new philosophical questions.
Abstract
Large language models like GPT-4 have achieved remarkable proficiency in a broad spectrum of language-based tasks, some of which are traditionally associated with hallmarks of human intelligence. This has prompted ongoing disagreements about the extent to which we can meaningfully ascribe any kind of linguistic or cognitive competence to language models. Such questions have deep philosophical roots, echoing longstanding debates about the status of artificial neural networks as cognitive models. This article -- the first part of two companion papers -- serves both as a primer on language models for philosophers, and as an opinionated survey of their significance in relation to classic debates in the philosophy cognitive science, artificial intelligence, and linguistics. We cover topics such as compositionality, language acquisition, semantic competence, grounding, world models, and the transmission of cultural knowledge. We argue that the success of language models challenges several long-held assumptions about artificial neural networks. However, we also highlight the need for further empirical investigation to better understand their internal mechanisms. This sets the stage for the companion paper (Part II), which turns to novel empirical methods for probing the inner workings of language models, and new philosophical questions prompted by their latest developments.
