Robust Fast Adaptation from Adversarially Explicit Task Distribution Generation
Cheems Wang, Yiqin Lv, Yixiu Mao, Yun Qu, Yi Xu, Xiangyang Ji
TL;DR
The paper tackles robustness gaps in meta-learning caused by distribution shifts by explicitly modeling task distributions over task identifiers and optimizing robust fast adaptation as a Stackelberg game. Task distributions are generated adversarially via normalizing flows, with a KL-based constraint ensuring shifts remain within a tolerable region, and the meta-learner acts as the leader while the distribution adversary acts as the follower. The authors establish convergence to a local Stackelberg equilibrium and derive a generalization bound under distribution shifts, while demonstrating improved robustness (via CVaR) on sinusoid, Acrobot, Pendulum, and meta-reinforcement learning benchmarks. The approach also reveals interpretable task-structure patterns in the learned distributions and provides code for replication, representing a practical advance for robust fast adaptation in real-world, shift-prone settings.
Abstract
Meta-learning is a practical learning paradigm to transfer skills across tasks from a few examples. Nevertheless, the existence of task distribution shifts tends to weaken meta-learners' generalization capability, particularly when the training task distribution is naively hand-crafted or based on simple priors that fail to cover critical scenarios sufficiently. Here, we consider explicitly generative modeling task distributions placed over task identifiers and propose robustifying fast adaptation from adversarial training. Our approach, which can be interpreted as a model of a Stackelberg game, not only uncovers the task structure during problem-solving from an explicit generative model but also theoretically increases the adaptation robustness in worst cases. This work has practical implications, particularly in dealing with task distribution shifts in meta-learning, and contributes to theoretical insights in the field. Our method demonstrates its robustness in the presence of task subpopulation shifts and improved performance over SOTA baselines in extensive experiments. The code is available at the project site https://sites.google.com/view/ar-metalearn.
