Meta Learning not to Learn: Robustly Informing Meta-Learning under Nuisance-Varying Families
Louis McConnell
TL;DR
This work addresses out-of-distribution generalization under nuisance-varying task families by balancing positive and negative inductive biases in meta-learning. It introduces Robustly Informed Meta Learning (RIME), a causal framework that uses inverse probability weighting and a mutual-information penalty to isolate nuisance factors $z$ from predictive signals, and employs an Informed Neural Process to model task-conditioned predictions with a learned latent representation $C$. The paper formalizes the objective $R(\\hat{p})=\\sup_{e \\in \\mathcal{E}} - \\mathbb{E}_{p(C)} \\mathbb{E}_{p_e(x | C)} D_{\\mathrm{KL}}(p_e(y | x, C) \\| \\hat{p}(y | x, C))$ and the RIME loss $\\mathcal{L}_{RIME}= \\mathcal{L}_1 + \\beta \\mathcal{L}_2 + \\lambda \\mathcal{L}_3$, where $\\mathcal{L}_3= \\mathbb{I}_{p_{\\perp\\perp}}[(C, r_{\\gamma}(x), y); z]$ controls residual information about $z$. The approach yields state-of-the-art performance on distributionally robust objectives in nuisance-varying settings, demonstrated across no-task-variability and task-variability experiments, with ablations showing the importance of informed critics and accurate mutual-information enforcement. This work provides both a theoretical and empirical framework for robust meta-learning under environment and task heterogeneity, with potential impact on medical imaging and other domains facing site- and environment-specific shifts.
Abstract
In settings where both spurious and causal predictors are available, standard neural networks trained under the objective of empirical risk minimization (ERM) with no additional inductive biases tend to have a dependence on a spurious feature. As a result, it is necessary to integrate additional inductive biases in order to guide the network toward generalizable hypotheses. Often these spurious features are shared across related tasks, such as estimating disease prognoses from image scans coming from different hospitals, making the challenge of generalization more difficult. In these settings, it is important that methods are able to integrate the proper inductive biases to generalize across both nuisance-varying families as well as task families. Motivated by this setting, we present RIME (Robustly Informed Meta lEarning), a new method for meta learning under the presence of both positive and negative inductive biases (what to learn and what not to learn). We first develop a theoretical causal framework showing why existing approaches at knowledge integration can lead to worse performance on distributionally robust objectives. We then show that RIME is able to simultaneously integrate both biases, reaching state of the art performance under distributionally robust objectives in informed meta-learning settings under nuisance-varying families.
