Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence
Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld
TL;DR
This work introduces Filtered Corpus Training (FiCT), a causal ablation approach to measure linguistic generalization by training language models on corpora from which specific constructions are filtered out. By applying FiCT to LSTMs and Transformer models and evaluating with BLiMP using metrics like $acc\Delta$ and $P\Delta$, the authors demonstrate that both architectures can generalize from indirect evidence even when direct examples are removed, though Transformers achieve better perplexity. A key finding is the dissociation between perplexity and linguistic generalization, implying that lower perplexity does not imply stronger grammatical generalization. The results provide evidence against purely memorization-based learning in LMs and offer a framework for more fine-grained evaluation of linguistic competence, with implications for model development and evaluation practices.
Abstract
This paper introduces Filtered Corpus Training, a method that trains language models (LMs) on corpora with certain linguistic constructions filtered out from the training data, and uses it to measure the ability of LMs to perform linguistic generalization on the basis of indirect evidence. We apply the method to both LSTM and Transformer LMs (of roughly comparable size), developing filtered corpora that target a wide range of linguistic phenomena. Our results show that while transformers are better qua LMs (as measured by perplexity), both models perform equally and surprisingly well on linguistic generalization measures, suggesting that they are capable of generalizing from indirect evidence.
