A Robust Prototype-Based Network with Interpretable RBF Classifier Foundations
Sascha Saralajew, Ashish Rana, Thomas Villmann, Ammar Shaker
TL;DR
This work addresses the interpretability gap in prototype-based classification by analyzing deep prototype-based networks (PBNs) and their relation to RBF classifiers, highlighting interpretability and robustness limitations. It introduces a robust Classification-by-Components (CBC) extension that removes indefinite reasoning in favor of class-specific priors and probabilistic negative reasoning, accompanied by a proven robustness bound and a loss that optimizes robustness. The authors show that deep PBNs are closely tied to deep RBF heads, but suffer interpretability issues, while shallow CBC offers inherent interpretability and provable robustness; empirically, the deep CBC achieves state-of-the-art accuracy on benchmarks and the shallow CBC demonstrates robustness advantages. The work provides a practical pathway to interpretable, robust prototype-based classification with strong empirical performance and theoretical guarantees, while outlining future work on extending to non-image domains and refining training stability.
Abstract
Prototype-based classification learning methods are known to be inherently interpretable. However, this paradigm suffers from major limitations compared to deep models, such as lower performance. This led to the development of the so-called deep Prototype-Based Networks (PBNs), also known as prototypical parts models. In this work, we analyze these models with respect to different properties, including interpretability. In particular, we focus on the Classification-by-Components (CBC) approach, which uses a probabilistic model to ensure interpretability and can be used as a shallow or deep architecture. We show that this model has several shortcomings, like creating contradicting explanations. Based on these findings, we propose an extension of CBC that solves these issues. Moreover, we prove that this extension has robustness guarantees and derive a loss that optimizes robustness. Additionally, our analysis shows that most (deep) PBNs are related to (deep) RBF classifiers, which implies that our robustness guarantees generalize to shallow RBF classifiers. The empirical evaluation demonstrates that our deep PBN yields state-of-the-art classification accuracy on different benchmarks while resolving the interpretability shortcomings of other approaches. Further, our shallow PBN variant outperforms other shallow PBNs while being inherently interpretable and exhibiting provable robustness guarantees.
