Limits for Learning with Language Models
Nicholas Asher, Swarnadeep Bhar, Akshay Chaturvedi, Julie Hunter, Soumya Paul
TL;DR
This paper establishes a theoretical limit on what language models can learn about linguistic meaning by integrating continuation semantics with formal semantics and the Borel hierarchy. It shows that universal quantification, entailment, and related semantic concepts fall outside the effective learnability of LLMs under standard inductive learning assumptions, constraining them to learn only certain basic (\Delta^0_1) sets. The authors formalize LLMs as distributions over continuations and connect learnability to statistical learning theory, ultimately arguing that higher-order semantic notions cannot be guaranteed by current architectures alone. They bolster the theory with empirical studies demonstrating quantification and order-related reasoning challenges in prevalent models like BERT, RoBERTa, GPT-3.5, and ChatGPT. The work implies that robust linguistic understanding requires integrating explicit linguistic structure into model design and training, rather than relying solely on scale, and it highlights important ethical considerations around model reliability in reasoning tasks.
Abstract
With the advent of large language models (LLMs), the trend in NLP has been to train LLMs on vast amounts of data to solve diverse language understanding and generation tasks. The list of LLM successes is long and varied. Nevertheless, several recent papers provide empirical evidence that LLMs fail to capture important aspects of linguistic meaning. Focusing on universal quantification, we provide a theoretical foundation for these empirical findings by proving that LLMs cannot learn certain fundamental semantic properties including semantic entailment and consistency as they are defined in formal semantics. More generally, we show that LLMs are unable to learn concepts beyond the first level of the Borel Hierarchy, which imposes severe limits on the ability of LMs, both large and small, to capture many aspects of linguistic meaning. This means that LLMs will continue to operate without formal guarantees on tasks that require entailments and deep linguistic understanding.
