Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Jonathan Kahana; Or Nathan; Eliahu Horwitz; Yedid Hoshen

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Jonathan Kahana, Or Nathan, Eliahu Horwitz, Yedid Hoshen

TL;DR

The paper tackles the problem of finding classifiers that recognize a target concept in large public model repositories without access to training data or metadata. It introduces ProbeLog, a logit-level descriptor derived by probing fixed inputs and normalizing logit responses, and extends it to zero-shot text-based search via CLIP-like alignment. A discrepancy metric tailored for logit descriptors and a collaborative probing strategy using matrix factorization enable scalable gallery encoding with reduced computational cost. Empirical results on INet-Hub and HF-Hub show high retrieval accuracy for both in-distribution and cross-domain queries, with substantial gains over baselines and practical reductions in probe requirements. This approach offers a principled, scalable solution for practical model search that can reduce training time, cost, and environmental impact while improving access to task-specific pretrained classifiers.

Abstract

With the increasing numbers of publicly available models, there are probably pretrained, online models for most tasks users require. However, current model search methods are rudimentary, essentially a text-based search in the documentation, thus users cannot find the relevant models. This paper presents ProbeLog, a method for retrieving classification models that can recognize a target concept, such as "Dog", without access to model metadata or training data. Differently from previous probing methods, ProbeLog computes a descriptor for each output dimension (logit) of each model, by observing its responses on a fixed set of inputs (probes). Our method supports both logit-based retrieval ("find more logits like this") and zero-shot, text-based retrieval ("find all logits corresponding to dogs"). As probing-based representations require multiple costly feedforward passes through the model, we develop a method, based on collaborative filtering, that reduces the cost of encoding repositories by 3x. We demonstrate that ProbeLog achieves high retrieval accuracy, both in real-world and fine-grained search tasks and is scalable to full-size repositories.

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

TL;DR

Abstract

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)