Biasless Language Models Learn Unnaturally: How LLMs Fail to Distinguish the Possible from the Impossible
Imry Ziv, Nur Lan, Emmanuel Chemla, Roni Katzir
TL;DR
This study investigates whether large language models (LLMs) inherently distinguish between humanly possible languages and their impossible perturbations. By extending previous English-focused analyses to eight languages and multiple perturbations, and training GPT-2 from scratch on baseline and perturbed datasets, the authors compare perplexity-based learning curves under intralinguistic and interlinguistic frameworks. They find that GPT-2 often learns possible and impossible variants with similar ease and show no systematic separation between attested and unattested language sets, challenging the notion that LLMs encode human-like innate linguistic biases. The results suggest that perplexity-based metrics may not capture the innate biases that shape human linguistic typology, highlighting a fundamental mismatch between LLM learning dynamics and human language cognition. Limitations include the scope of perturbations and model size, indicating the need for broader empirical coverage across architectures and languages.
Abstract
Are large language models (LLMs) sensitive to the distinction between humanly possible languages and humanly impossible languages? This question is taken by many to bear on whether LLMs and humans share the same innate learning biases. Previous work has attempted to answer it in the positive by comparing LLM learning curves on existing language datasets and on "impossible" datasets derived from them via various perturbation functions. Using the same methodology, we examine this claim on a wider set of languages and impossible perturbations. We find that in most cases, GPT-2 learns each language and its impossible counterpart equally easily, in contrast to previous claims. We also apply a more lenient condition by testing whether GPT-2 provides any kind of separation between the whole set of natural languages and the whole set of impossible languages. By considering cross-linguistic variance in various metrics computed on the perplexity curves, we show that GPT-2 provides no systematic separation between the possible and the impossible. Taken together, these perspectives show that LLMs do not share the human innate biases that shape linguistic typology.
