Rashomon Sets for Prototypical-Part Networks: Editing Interpretable Models in Real-Time
Jon Donnelly, Zhicheng Guo, Alina Jade Barnett, Hayden McTavish, Chaofan Chen, Cynthia Rudin
TL;DR
Proto-RSet introduces a practical framework to edit interpretable ProtoPNets in real time by precomputing a Rashomon set of near-optimal models. It fixes the backbone and prototypes and locally approximates the last-layer space as an ellipsoid around the optimal weights, enabling fast sampling and constraint enforcement, including prototype removal or requirement. Across multiple datasets and backbones, Proto-RSet yields models that meet user constraints while preserving or improving accuracy, and user studies show clinicians and non-experts can rapidly refine models with guarantees. This approach shifts model debugging from slow retraining to interactive, explainable model-space exploration with real-world impact in high-stakes domains such as skin cancer classification and bias removal in bird identification.
Abstract
Interpretability is critical for machine learning models in high-stakes settings because it allows users to verify the model's reasoning. In computer vision, prototypical part models (ProtoPNets) have become the dominant model type to meet this need. Users can easily identify flaws in ProtoPNets, but fixing problems in a ProtoPNet requires slow, difficult retraining that is not guaranteed to resolve the issue. This problem is called the "interaction bottleneck." We solve the interaction bottleneck for ProtoPNets by simultaneously finding many equally good ProtoPNets (i.e., a draw from a "Rashomon set"). We show that our framework - called Proto-RSet - quickly produces many accurate, diverse ProtoPNets, allowing users to correct problems in real time while maintaining performance guarantees with respect to the training set. We demonstrate the utility of this method in two settings: 1) removing synthetic bias introduced to a bird identification model and 2) debugging a skin cancer identification model. This tool empowers non-machine-learning experts, such as clinicians or domain experts, to quickly refine and correct machine learning models without repeated retraining by machine learning experts.
