Interpretable Image Classification via Non-parametric Part Prototype Learning
Zhijie Zhu, Lei Fan, Maurice Pagnucco, Yang Song
TL;DR
This work tackles the limited interpretability of ProtoPNets due to repetitive part explanations by introducing non-parametric part-prototypes learned per class via clustering of backbone features, validated on fine-grained datasets. It employs a two-stage training with foundation Vision Transformers (e.g., $\text{ViT}$ backbones pre-trained with self-distillation) and a prototype-anchored fine-tuning strategy, including a Patch-Prototype Distance Contrastive loss and Block Expansion for efficient feature space grounding. Key contributions include a robust, diverse set of part-prototypes, an optimal-transport-based assignment with entropic regularization, and two new metrics—Distinctiveness and Comprehensiveness—to quantify explanation diversity and foreground coverage. Empirically, the method achieves competitive classification accuracy while delivering richer, more holistic explanations, demonstrating practical impact for trustworthy, interpretable image classification and adaptable concept-based explanations.
Abstract
Classifying images with an interpretable decision-making process is a long-standing problem in computer vision. In recent years, Prototypical Part Networks has gained traction as an approach for self-explainable neural networks, due to their ability to mimic human visual reasoning by providing explanations based on prototypical object parts. However, the quality of the explanations generated by these methods leaves room for improvement, as the prototypes usually focus on repetitive and redundant concepts. Leveraging recent advances in prototype learning, we present a framework for part-based interpretable image classification that learns a set of semantically distinctive object parts for each class, and provides diverse and comprehensive explanations. The core of our method is to learn the part-prototypes in a non-parametric fashion, through clustering deep features extracted from foundation vision models that encode robust semantic information. To quantitatively evaluate the quality of explanations provided by ProtoPNets, we introduce Distinctiveness Score and Comprehensiveness Score. Through evaluation on CUB-200-2011, Stanford Cars and Stanford Dogs datasets, we show that our framework compares favourably against existing ProtoPNets while achieving better interpretability. Code is available at: https://github.com/zijizhu/proto-non-param.
