An Overview of Prototype Formulations for Interpretable Deep Learning
Maximilian Xiling Li, Korbinian Franz Rudolf, Paul Mattes, Nils Blank, Rudolf Lioutikov
TL;DR
The paper addresses the interpretability gap in deep vision models by systematically evaluating prototype-based approaches. It introduces HyperPG, a probabilistic prototype representation on the hypersphere that models a Gaussian over cosine similarities, and benchmarks it against Euclidean and cosine prototypes across multiple datasets. Results show hyperspherical, probabilistic formulations offer competitive or superior performance with markedly reduced sensitivity to training hyperparameters, especially under simplified optimization regimes, and provide robust interpretability through prototype activations. The findings suggest that hyperspherical and probabilistic prototypes enhance practical deployment of interpretable deep learning, with avenues for extending to mixture models and Bayesian variants.
Abstract
Prototypical part networks offer interpretable alternatives to black-box deep learning models by learning visual prototypes for classification. This work provides a comprehensive analysis of prototype formulations, comparing point-based and probabilistic approaches in both Euclidean and hyperspherical latent spaces. We introduce HyperPG, a probabilistic prototype representation using Gaussian distributions on hyperspheres. Experiments on CUB-200-2011, Stanford Cars, and Oxford Flowers datasets show that hyperspherical prototypes outperform standard Euclidean formulations. Critically, hyperspherical prototypes maintain competitive performance under simplified training schemes, while Euclidean prototypes require extensive hyperparameter tuning.
