MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes
Bor-Shiun Wang, Chien-Yi Wang, Wei-Chen Chiu
TL;DR
MCPNet addresses the need for faithful, multi-scale model explanations by learning multi-level concept prototypes across backbone layers using a Centered Kernel Alignment loss and an energy-weighted PCA. Classification is performed via MCP distributions, aligning image concepts with class-specific centroids through a Class-aware Concept Distribution loss, without relying on a final fully connected layer. The approach yields richer, multi-level explanations—from low-level color cues to high-level object concepts—while maintaining competitive accuracy and demonstrating strong generalization in few-shot scenarios. This framework enhances interpretability and trust in deep models with practical implications for transparent AI in visual classification tasks.
Abstract
Recent advancements in post-hoc and inherently interpretable methods have markedly enhanced the explanations of black box classifier models. These methods operate either through post-analysis or by integrating concept learning during model training. Although being effective in bridging the semantic gap between a model's latent space and human interpretation, these explanation methods only partially reveal the model's decision-making process. The outcome is typically limited to high-level semantics derived from the last feature map. We argue that the explanations lacking insights into the decision processes at low and mid-level features are neither fully faithful nor useful. Addressing this gap, we introduce the Multi-Level Concept Prototypes Classifier (MCPNet), an inherently interpretable model. MCPNet autonomously learns meaningful concept prototypes across multiple feature map levels using Centered Kernel Alignment (CKA) loss and an energy-based weighted PCA mechanism, and it does so without reliance on predefined concept labels. Further, we propose a novel classifier paradigm that learns and aligns multi-level concept prototype distributions for classification purposes via Class-aware Concept Distribution (CCD) loss. Our experiments reveal that our proposed MCPNet while being adaptable to various model architectures, offers comprehensive multi-level explanations while maintaining classification accuracy. Additionally, its concept distribution-based classification approach shows improved generalization capabilities in few-shot classification scenarios.
