Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

Hossein Zakerinia; Christoph H. Lampert

Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

Hossein Zakerinia, Christoph H. Lampert

TL;DR

This work addresses the gap in fast-rate PAC-Bayesian generalization bounds for unbalanced multi-task learning and meta-learning, where tasks have varying numbers of training samples. It develops fast-rate bounds of KL-style and Catoni-style across two risk definitions—task-centric and sample-centric—and extends these results to meta-learning via a hyper-posterior framework. The authors show that unbalanced settings have distinct statistical properties from balanced ones and provide numerically computable bounds, including practical procedures to combine bounds for tighter guarantees. Empirical results on linear models and neural networks demonstrate that the new fast-rate bounds can offer tighter, non-vacuous guarantees than standard-rate bounds, especially when training errors are small.

Abstract

We present new fast-rate PAC-Bayesian generalization bounds for multi-task and meta-learning in the unbalanced setting, i.e. when the tasks have training sets of different sizes, as is typically the case in real-world scenarios. Previously, only standard-rate bounds were known for this situation, while fast-rate bounds were limited to the setting where all training sets are of equal size. Our new bounds are numerically computable as well as interpretable, and we demonstrate their flexibility in handling a number of cases where they give stronger guarantees than previous bounds. Besides the bounds themselves, we also make conceptual contributions: we demonstrate that the unbalanced multi-task setting has different statistical properties than the balanced situation, specifically that proofs from the balanced situation do not carry over to the unbalanced setting. Additionally, we shed light on the fact that the unbalanced situation allows two meaningful definitions of multi-task risk, depending on whether all tasks should be considered equally important or if sample-rich tasks should receive more weight than sample-poor ones.

Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

TL;DR

Abstract

Fast Rate Bounds for Multi-Task and Meta-Learning with Different Sample Sizes

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (32)