Child vs. machine language learning: Can the logical structure of human language unleash LLMs?
Uli Sauerland, Celia Matthaei, Felix Salfner
TL;DR
The paper argues that human language learning is guided by intrinsic logical structures, which contrast with current LLM training that lacks bias toward algebraic relationships like negation. It predicts that certain language properties, such as German default plural formation, will reveal suboptimal generalization in LLMs when these logical connections are not explicitly encoded. Through a nonce-noun plural task adapted from Marcus et al., it demonstrates that seven LLMs largely fail to generalize the default plural $-s$ in German, highlighting systematic gaps in how LLMs capture morpho-logic. The authors advocate cross-disciplinary efforts and curriculum-like training strategies to inject linguistic logic into LLMs, suggesting that such approaches could improve generalization, efficiency, and linguistic competence in AI systems.
Abstract
We argue that human language learning proceeds in a manner that is different in nature from current approaches to training LLMs, predicting a difference in learning biases. We then present evidence from German plural formation by LLMs that confirm our hypothesis that even very powerful implementations produce results that miss aspects of the logic inherent to language that humans have no problem with. We conclude that attention to the different structures of human language and artificial neural networks is likely to be an avenue to improve LLM performance.
