More Flexible PAC-Bayesian Meta-Learning by Learning Learning Algorithms
Hossein Zakerinia, Amin Behjati, Christoph H. Lampert
TL;DR
The paper extends PAC-Bayesian theory to meta-learning by modeling knowledge transfer as learning learning algorithms, not just priors. It introduces two generalization bounds that apply to a broad set of learning algorithms, including hypernetworks, representations, and optimization-based methods, by employing a meta-posterior over algorithms and hyper-posteriors over priors. The bounds separate task-level generalization from within-task generalization and accommodate algorithm-specific hyper-priors, enabling flexible, environment-adaptive transfers. Empirical studies show the new bounds yield tighter estimates and that decoupling initialization from regularization can improve performance on low-data meta-learning tasks. Overall, the framework promises broader applicability and potential improvements for practical meta-learning across diverse mechanisms.
Abstract
We introduce a new framework for studying meta-learning methods using PAC-Bayesian theory. Its main advantage over previous work is that it allows for more flexibility in how the transfer of knowledge between tasks is realized. For previous approaches, this could only happen indirectly, by means of learning prior distributions over models. In contrast, the new generalization bounds that we prove express the process of meta-learning much more directly as learning the learning algorithm that should be used for future tasks. The flexibility of our framework makes it suitable to analyze a wide range of meta-learning mechanisms and even design new mechanisms. Other than our theoretical contributions we also show empirically that our framework improves the prediction quality in practical meta-learning mechanisms.
