Teaching Small Language Models to Learn Logic through Meta-Learning
Leonardo Bertolazzi, Manuel Vargas Guzmán, Raffaella Bernardi, Maciej Malicki, Jakub Szymanik
TL;DR
The paper tackles the challenge of out-of-distribution logical generalization in large language models by applying few-shot meta-learning to syllogistic reasoning, a well-defined logical fragment. It constructs synthetic knowledge bases and frames premise selection as selecting the minimal set of premises that entail a query, training small autoregressive models to extract abstract inference patterns across tasks. Empirical results show that 1.5B–7B Qwen models fine-tuned with meta-learning achieve strong generalization, often outperforming GPT-4o and o3-mini on the syllogistic task, particularly in low-data regimes. The findings suggest that meta-learning can enhance deductive reasoning in small LMs and provide a pathway toward more robust, abstract reasoning capabilities in practical AI systems.
Abstract
Large language models (LLMs) are increasingly evaluated on reasoning tasks, yet their logical abilities remain contested. To address this, we study LLMs' reasoning in a well-defined fragment of logic: syllogistic reasoning. We cast the problem as premise selection and construct controlled datasets to isolate logical competence. Beyond evaluation, an open challenge is enabling LLMs to acquire abstract inference patterns that generalize to novel structures. We propose to apply few-shot meta-learning to this domain, thereby encouraging models to extract rules across tasks rather than memorize patterns within tasks. Although meta-learning has been little explored in the context of logic learnability, our experiments show that it is effective: small models (1.5B-7B) fine-tuned with meta-learning demonstrate strong gains in generalization, with especially pronounced benefits in low-data regimes. These meta-learned models outperform GPT-4o and o3-mini on our syllogistic reasoning task.
