MAC: A Meta-Learning Approach for Feature Learning and Recombination
S. Tiwari, M. Gogoi, S. Verma, K. P. Singh
TL;DR
MAC extends optimization-based meta-learning by introducing Additional Connection Units (ACUs) to widen the network, enabling learning of new task-specific features during meta-testing while freezing base meta-trained parameters to preserve prior knowledge. By combining ACU-driven feature learning with existing meta-learned representations, MAC achieves improved adaptation under distribution shifts, outperforming ANIL on perturbed distributions in few-shot tasks (notably around $\approx 13\%$ in 5-shot scenarios). The approach emphasizes width over depth for accommodating new features and provides empirical guidance on ACU quantities, placement, and depth, along with an analysis of computational implications. Overall, MAC demonstrates that controlled feature learning and recombination are crucial for robust meta-testing in non-identical task distributions, offering a practical route for few-shot learning under real-world distribution shifts.
Abstract
Optimization-based meta-learning aims to learn an initialization so that a new unseen task can be learned within a few gradient updates. Model Agnostic Meta-Learning (MAML) is a benchmark algorithm comprising two optimization loops. The inner loop is dedicated to learning a new task and the outer loop leads to meta-initialization. However, ANIL (almost no inner loop) algorithm shows that feature reuse is an alternative to rapid learning in MAML. Thus, the meta-initialization phase makes MAML primed for feature reuse and obviates the need for rapid learning. Contrary to ANIL, we hypothesize that there may be a need to learn new features during meta-testing. A new unseen task from non-similar distribution would necessitate rapid learning in addition reuse and recombination of existing features. In this paper, we invoke the width-depth duality of neural networks, wherein, we increase the width of the network by adding extra computational units (ACU). The ACUs enable the learning of new atomic features in the meta-testing task, and the associated increased width facilitates information propagation in the forwarding pass. The newly learnt features combine with existing features in the last layer for meta-learning. Experimental results show that our proposed MAC method outperformed existing ANIL algorithm for non-similar task distribution by approximately 13% (5-shot task setting)
