Attention-based Iterative Decomposition for Tensor Product Representation
Taewon Park, Inchul Choi, Minho Lee
TL;DR
This work addresses the challenge of incomplete decomposition in Tensor Product Representations (TPR) that hinders systematic generalization. It introduces Attention-based Iterative Decomposition (AID), a slot-based competitive-attention module that iteratively binds input features to TPR roles and fillers, with a context-dependent routing mechanism to map components to symbols. By integrating AID into TPR-RNN, Fast Weight Memory, and Linear Transformer architectures, the paper demonstrates consistent improvements across synthetic SAR, systematic bAbI, Sort-of-CLEVR, and WikiText-103 tasks, supported by disentanglement and orthogonality analyses. The findings indicate that better decomposition yields more compositional, well-bound representations and enhances generalization to unseen compositions, with practical implications for scalable, real-world applications of TPR-based models.
Abstract
In recent research, Tensor Product Representation (TPR) is applied for the systematic generalization task of deep neural networks by learning the compositional structure of data. However, such prior works show limited performance in discovering and representing the symbolic structure from unseen test data because their decomposition to the structural representations was incomplete. In this work, we propose an Attention-based Iterative Decomposition (AID) module designed to enhance the decomposition operations for the structured representations encoded from the sequential input data with TPR. Our AID can be easily adapted to any TPR-based model and provides enhanced systematic decomposition through a competitive attention mechanism between input features and structured representations. In our experiments, AID shows effectiveness by significantly improving the performance of TPR-based prior works on the series of systematic generalization tasks. Moreover, in the quantitative and qualitative evaluations, AID produces more compositional and well-bound structural representations than other works.
