Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

📅 2025-02-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing model repositories lack effective retrieval mechanisms, relying solely on text-based search, which impedes concept-level matching (e.g., “dog”) due to semantic misalignment between natural language queries and model internals. Method: This paper proposes a zero-shot classification model retrieval method that requires neither model metadata nor training data. It extracts semantic descriptors from the logit layer of each model using fixed probe inputs, enabling direct alignment of human-understandable concepts to logit-space representations and similarity measurement therein. Contribution/Results: We introduce a novel logit-level probing representation paradigm and a collaborative filtering acceleration strategy, reducing large-scale model encoding overhead by 3×. Our approach achieves high retrieval accuracy on both real-world model repositories and fine-grained downstream tasks, and scales effectively to comprehensive public model warehouses.

Technology Category

Application Category

📝 Abstract
With the increasing numbers of publicly available models, there are probably pretrained, online models for most tasks users require. However, current model search methods are rudimentary, essentially a text-based search in the documentation, thus users cannot find the relevant models. This paper presents ProbeLog, a method for retrieving classification models that can recognize a target concept, such as"Dog", without access to model metadata or training data. Differently from previous probing methods, ProbeLog computes a descriptor for each output dimension (logit) of each model, by observing its responses on a fixed set of inputs (probes). Our method supports both logit-based retrieval ("find more logits like this") and zero-shot, text-based retrieval ("find all logits corresponding to dogs"). As probing-based representations require multiple costly feedforward passes through the model, we develop a method, based on collaborative filtering, that reduces the cost of encoding repositories by 3x. We demonstrate that ProbeLog achieves high retrieval accuracy, both in real-world and fine-grained search tasks and is scalable to full-size repositories.
Problem

Research questions and friction points this paper is trying to address.

Retrieve classification models without metadata
Develop cost-effective model encoding method
Achieve high accuracy in model retrieval
Innovation

Methods, ideas, or system contributions that make the work stand out.

ProbeLog retrieves models without metadata
Uses logit-based and zero-shot text retrieval
Reduces encoding costs via collaborative filtering
J
Jonathan Kahana
School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel
O
Or Nathan
School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel
E
Eliahu Horwitz
School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel
Yedid Hoshen
Yedid Hoshen
The Hebrew University of Jerusalem
Deep LearningAIComputer Vision