Representational Similarity via Interpretable Visual Concepts
Neehar Kondapaneni, Oisin Mac Aodha, Pietro Perona
TL;DR
This work tackles the challenge of not only measuring how similarly two networks represent information but also exposing what visual concepts underlie their similarities and differences. It introduces Representational Similarity via Interpretable Visual Concepts (RSVC), which decomposes layer activations into concept dictionaries and coefficients, then assesses cross-model concept alignment via regression-based mappings and correlation metrics. The approach enables interpretation of both shared and unique concepts, supports a replacement test to connect representational changes to model decisions, and provides visualizations for low-similarity concepts. Extensive experiments across CNNs and ViTs, with varying training protocols and even an LLVM-assisted qualitative analysis, demonstrate RSVC's generality, its ability to reveal meaningful differences, and its potential to inform debugging and fairness considerations in vision models.
Abstract
How do two deep neural networks differ in how they arrive at a decision? Measuring the similarity of deep networks has been a long-standing open question. Most existing methods provide a single number to measure the similarity of two networks at a given layer, but give no insight into what makes them similar or dissimilar. We introduce an interpretable representational similarity method (RSVC) to compare two networks. We use RSVC to discover shared and unique visual concepts between two models. We show that some aspects of model differences can be attributed to unique concepts discovered by one model that are not well represented in the other. Finally, we conduct extensive evaluation across different vision model architectures and training protocols to demonstrate its effectiveness.
