Compositional meta-learning through probabilistic task inference
Jacob J. W. Bakermans, Pablo Tano, Reidar Riveland, Charles Findling, Alexandre Pouget
TL;DR
This work addresses meta-learning by enabling rapid adaptation to new tasks through compositional inference over learned task structure. It introduces a probabilistic generative model that separates within-module dynamics from between-module dynamics using a gating network $G_{m{ heta}}$ and modular RNNs ${M_{m{}}^z}$, trained by maximizing the marginal likelihood $L = p(m{y}_{1:T};oldsymbol{})$. For new tasks, solutions are obtained via inference (e.g., particle filtering) to identify the best module sequence ${z}_{1:T}$ without updating parameters, achieving one-shot acquisition in abstract rule learning and motor skills, even under sparse feedback. The framework bridges neural expressivity with probabilistic data efficiency, with potential extensions to continual learning, dynamic module growth, and more expressive task grammars.
Abstract
To solve a new task from minimal experience, it is essential to effectively reuse knowledge from previous tasks, a problem known as meta-learning. Compositional solutions, where common elements of computation are flexibly recombined into new configurations, are particularly well-suited for meta-learning. Here, we propose a compositional meta-learning model that explicitly represents tasks as structured combinations of reusable computations. We achieve this by learning a generative model that captures the underlying components and their statistics shared across a family of tasks. This approach transforms learning a new task into a probabilistic inference problem, which allows for finding solutions without parameter updates through highly constrained hypothesis testing. Our model successfully recovers ground truth components and statistics in rule learning and motor learning tasks. We then demonstrate its ability to quickly infer new solutions from just single examples. Together, our framework joins the expressivity of neural networks with the data-efficiency of probabilistic inference to achieve rapid compositional meta-learning.
