Feature Space Perturbation: A Panacea to Enhanced Transferability Estimation
Prafful Kumar Khoba, Zijian Wang, Chetan Arora, Mahsa Baktashmotlagh
TL;DR
This work targets robust model ranking for transfer learning by introducing feature space perturbation (Spread and Attract, SA) to test embedding resilience and improve transferability estimates. SA perturbs intra-class structure via Spread and blurs inter-class boundaries via Attract, producing perturbations $\mathbf{\hat{X}}_{\text{attract}}$ that are fed to a transferability metric $\mathcal{M}$ to yield scores $T_l$ used for ranking, with PCA used to manage dimensionality. Empirically, SA yields substantial gains across vanilla, last-block fine-tuning (LBFT), and linear fine-tuning (LFT), e.g., up to $28.84\%$ improvement on LogMe and notable gains across multiple metrics; for self-supervised models, an LDA-based metric outperforms SOTA by $12.7\%$ (vanilla FT) and $15.06\%$ (LFT). The results highlight robustness as a critical dimension in transferability estimation and suggest promising directions for adaptive metrics that jointly optimize adaptability and robustness across supervised and self-supervised paradigms.
Abstract
Leveraging a transferability estimation metric facilitates the non-trivial challenge of selecting the optimal model for the downstream task from a pool of pre-trained models. Most existing metrics primarily focus on identifying the statistical relationship between feature embeddings and the corresponding labels within the target dataset, but overlook crucial aspect of model robustness. This oversight may limit their effectiveness in accurately ranking pre-trained models. To address this limitation, we introduce a feature perturbation method that enhances the transferability estimation process by systematically altering the feature space. Our method includes a Spread operation that increases intra-class variability, adding complexity within classes, and an Attract operation that minimizes the distances between different classes, thereby blurring the class boundaries. Through extensive experimentation, we demonstrate the efficacy of our feature perturbation method in providing a more precise and robust estimation of model transferability. Notably, the existing LogMe method exhibited a significant improvement, showing a 28.84% increase in performance after applying our feature perturbation method.
