Task-Specific Preconditioner for Cross-Domain Few-Shot Learning
Suhyun Kang, Jungwon Park, Wonseok Lee, Wonjong Rhee
TL;DR
This work tackles cross-domain few-shot learning by introducing Task-Specific Preconditioned gradient descent (TSP), a mechanism that adapts optimization to the target task via a positive-definite preconditioner. TSP meta-learns Domain-Specific Preconditioners (DSPs) for each seen domain and combines them with task-coefficients derived from a Dataset Classifier to form a Task-Specific Preconditioner that guides gradient descent toward the steepest-descent direction in the target task’s geometry. Key contributions include a formal PD design for DSPs, a bi-level optimization framework to learn both DSPs and task-coefficients, and extensive experiments on Meta-Dataset showing state-of-the-art results in both multi-domain and single-domain settings, with ablations highlighting the PD constraint’s importance. The approach enables robust cross-domain adaptation by leveraging multi-domain knowledge to tailor the optimization process, yielding practical improvements for cross-domain few-shot classification tasks.
Abstract
Cross-Domain Few-Shot Learning~(CDFSL) methods typically parameterize models with task-agnostic and task-specific parameters. To adapt task-specific parameters, recent approaches have utilized fixed optimization strategies, despite their potential sub-optimality across varying domains or target tasks. To address this issue, we propose a novel adaptation mechanism called Task-Specific Preconditioned gradient descent~(TSP). Our method first meta-learns Domain-Specific Preconditioners~(DSPs) that capture the characteristics of each meta-training domain, which are then linearly combined using task-coefficients to form the Task-Specific Preconditioner. The preconditioner is applied to gradient descent, making the optimization adaptive to the target task. We constrain our preconditioners to be positive definite, guiding the preconditioned gradient toward the direction of steepest descent. Empirical evaluations on the Meta-Dataset show that TSP achieves state-of-the-art performance across diverse experimental scenarios.
