A linearized framework and a new benchmark for model selection for fine-tuning
Aditya Deshpande, Alessandro Achille, Avinash Ravichandran, Hao Li, Luca Zancato, Charless Fowlkes, Rahul Bhotika, Stefano Soatto, Pietro Perona
TL;DR
The paper addresses pre-selecting the best pre-trained model to fine-tune from a diverse zoo without training, focusing on low-data transfer. It introduces a linearized approximation to fine-tuning inspired by the Neural Tangent Kernel and derives two baselines, Label-Gradient Correlation (LGC) and Label-Feature Correlation (LFC), from the interaction between target labels and pre-trained gradients or features. A large-scale benchmark with 30 single-domain and a multi-domain expert trained on 8 source datasets across many target tasks demonstrates that model zoos can outperform Imagenet-based fine-tuning, especially in data-scarce regimes. The results show LGC and particularly LFC correlate strongly with actual fine-tuning performance, enabling fast, few-shot model selection that reduces brute-force searches and offers practical gains for domain transfer. Overall, the work provides a principled, scalable approach to model reuse and a benchmark to advance model-selection research.
Abstract
Fine-tuning from a collection of models pre-trained on different domains (a "model zoo") is emerging as a technique to improve test accuracy in the low-data regime. However, model selection, i.e. how to pre-select the right model to fine-tune from a model zoo without performing any training, remains an open topic. We use a linearized framework to approximate fine-tuning, and introduce two new baselines for model selection -- Label-Gradient and Label-Feature Correlation. Since all model selection algorithms in the literature have been tested on different use-cases and never compared directly, we introduce a new comprehensive benchmark for model selection comprising of: i) A model zoo of single and multi-domain models, and ii) Many target tasks. Our benchmark highlights accuracy gain with model zoo compared to fine-tuning Imagenet models. We show our model selection baseline can select optimal models to fine-tune in few selections and has the highest ranking correlation to fine-tuning accuracy compared to existing algorithms.
