Exploring Continual Learning of Compositional Generalization in NLI
Xiyan Fu, Anette Frank
TL;DR
The paper introduces C2Gen NLI, a continual-learning framework to study how models acquire primitive inferences over time and how this sequential learning affects compositional generalization in natural language inference. It shows that continual learning induces forgetting, hindering unseen compositional inferences, but that memorization-based strategies and carefully ordered learning (e.g., easy-before-hard, dependency-aware primitive ordering) can mitigate forgetting and improve generalization. Through offline CGen and continual C2Gen experiments, plus analyses of primitive recognition versus compositional inference, the work demonstrates when and how continual learning helps, and where it still falls short compared to non-continual training. The findings have practical implications for dynamic, knowledge-updating applications (e.g., Persona Dialogue) and point to future directions in learning-order optimization and curriculum-inspired approaches to enhance continual compositional generalization in NLP.
Abstract
Compositional Natural Language Inference has been explored to assess the true abilities of neural models to perform NLI. Yet, current evaluations assume models to have full access to all primitive inferences in advance, in contrast to humans that continuously acquire inference knowledge. In this paper, we introduce the Continual Compositional Generalization in Inference (C2Gen NLI) challenge, where a model continuously acquires knowledge of constituting primitive inference tasks as a basis for compositional inferences. We explore how continual learning affects compositional generalization in NLI, by designing a continual learning setup for compositional NLI inference tasks. Our experiments demonstrate that models fail to compositionally generalize in a continual scenario. To address this problem, we first benchmark various continual learning algorithms and verify their efficacy. We then further analyze C2Gen, focusing on how to order primitives and compositional inference types and examining correlations between subtasks. Our analyses show that by learning subtasks continuously while observing their dependencies and increasing degrees of difficulty, continual learning can enhance composition generalization ability.
