Adaptive Gradient-Based Meta-Learning Methods

Mikhail Khodak; Maria-Florina Balcan; Ameet Talwalkar

Adaptive Gradient-Based Meta-Learning Methods

Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

TL;DR

ARUBA reframes gradient-based meta-learning as online learning of regret upper-bounds, enabling adaptive learning of task similarity and dynamic environment handling through online mirror descent with Bregman divergences. It provides theoretical guarantees for static, dynamic, and statistical learning-to-learn settings, and introduces practical per-coordinate learning-rate mechanisms (ARUBA and variants) that automatically adapt to task structure and geometry. Empirically, ARUBA improves meta-test-time performance on few-shot classification and federated learning benchmarks, while reducing the need for hyperparameter tuning. Overall, ARUBA offers a principled, tunable framework that extends GBML to adaptive, geometry-aware, and communication-efficient settings with strong transfer guarantees.

Abstract

We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential prediction algorithms. Our approach enables the task-similarity to be learned adaptively, provides sharper transfer-risk bounds in the setting of statistical learning-to-learn, and leads to straightforward derivations of average-case regret bounds for efficient algorithms in settings where the task-environment changes dynamically or the tasks share a certain geometric structure. We use our theory to modify several popular meta-learning algorithms and improve their meta-test-time performance on standard problems in few-shot learning and federated learning.

Adaptive Gradient-Based Meta-Learning Methods

TL;DR

Abstract

Adaptive Gradient-Based Meta-Learning Methods

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (98)