On The Specialization of Neural Modules
Devon Jarvis, Richard Klein, Benjamin Rosman, Andrew M. Saxe
TL;DR
This work formalizes systematic generalization by separating compositional structure from world-wide structure and introducing a tractable dataset space with input/output blocks $X=[\Omega_x \ \Gamma_x]^T$, $Y=[\Omega_y \ \Gamma_y]^T$. It analyzes training dynamics of deep and shallow linear networks via SVD on covariances $\Sigma^x$ and $\Sigma^{yx}$, showing that learning proceeds along three effective singular-value modes and that dense networks inherently couple compositional and non-compositional sub-structures, hindering systematicity. The authors demonstrate that modular architectures can achieve fully systematic mappings only when the lower-rank, compositional sub-structure is perfectly segregated, and they validate these insights in CMNIST, where a split, modular CNN preserves compositional generalization while a dense network fails. Collectively, the paper clarifies how dataset structure and architectural biases interact to enable or obstruct systematic module specialization, informing design principles for robust, modular generalization in neural networks.
Abstract
A number of machine learning models have been proposed with the goal of achieving systematic generalization: the ability to reason about new situations by combining aspects of previous experiences. These models leverage compositional architectures which aim to learn specialized modules dedicated to structures in a task that can be composed to solve novel problems with similar structures. While the compositionality of these architectures is guaranteed by design, the modules specializing is not. Here we theoretically study the ability of network modules to specialize to useful structures in a dataset and achieve systematic generalization. To this end we introduce a minimal space of datasets motivated by practical systematic generalization benchmarks. From this space of datasets we present a mathematical definition of systematicity and study the learning dynamics of linear neural modules when solving components of the task. Our results shed light on the difficulty of module specialization, what is required for modules to successfully specialize, and the necessity of modular architectures to achieve systematicity. Finally, we confirm that the theoretical results in our tractable setting generalize to more complex datasets and non-linear architectures.
