Compositional generalization through meta sequence-to-sequence learning
Brenden M. Lake
TL;DR
Standard seq2seq models struggle with compositional generalization. The paper introduces meta seq2seq learning, a memory-augmented, episodic meta-training framework that enables learning of rule-like generalization over sequences. Across several SCAN-based experiments, the approach achieves strong performance on many compositional tasks, but fails to systematically generalize to much longer output sequences, highlighting both progress and remaining gaps. These findings suggest a promising direction for combining memory and meta-learning, with potential extensions toward neuro-symbolic hybrids to handle longer and more abstract generalizations.
Abstract
People can learn a new concept and use it compositionally, understanding how to "blicket twice" after learning how to "blicket." In contrast, powerful sequence-to-sequence (seq2seq) neural networks fail such tests of compositionality, especially when composing new concepts together with existing concepts. In this paper, I show how memory-augmented neural networks can be trained to generalize compositionally through meta seq2seq learning. In this approach, models train on a series of seq2seq problems to acquire the compositional skills needed to solve new seq2seq problems. Meta se2seq learning solves several of the SCAN tests for compositional learning and can learn to apply implicit rules to variables.
