SSplain: Sparse and Smooth Explainer for Retinopathy of Prematurity Classification
Elifnur Sunger, Tales Imbiriba, Peter Campbell, Deniz Erdogmus, Stratis Ioannidis, Jennifer Dy
TL;DR
SSplain addresses the need for interpretable explanations in Retinopathy of Prematurity classification by learning pixelwise masks that are simultaneously sparse and smooth, preserving the intrinsic vessel structure. It casts explanation generation as a constrained optimization problem, solved with ADMM to accommodate hard sparsity (ell0/ell1) and range constraints along with total variation regularization. Empirical results on ROP, MNIST, and FMNIST show SSplain outperforms nine baselines in post-hoc accuracy and aligns with domain-relevant cues such as vessel tortuosity and dilation, while maintaining sturdiness under sanity checks. The approach offers a practical pathway to domain-aware, reliable explanations that can generalize across image domains via adaptable sparsity and smoothness constraints.
Abstract
Neural networks are frequently used in medical diagnosis. However, due to their black-box nature, model explainers are used to help clinicians understand better and trust model outputs. This paper introduces an explainer method for classifying Retinopathy of Prematurity (ROP) from fundus images. Previous methods fail to generate explanations that preserve input image structures such as smoothness and sparsity. We introduce Sparse and Smooth Explainer (SSplain), a method that generates pixel-wise explanations while preserving image structures by enforcing smoothness and sparsity. This results in realistic explanations to enhance the understanding of the given black-box model. To achieve this goal, we define an optimization problem with combinatorial constraints and solve it using the Alternating Direction Method of Multipliers (ADMM). Experimental results show that SSplain outperforms commonly used explainers in terms of both post-hoc accuracy and smoothness analyses. Additionally, SSplain identifies features that are consistent with domain-understandable features that clinicians consider as discriminative factors for ROP. We also show SSplain's generalization by applying it to additional publicly available datasets. Code is available at https://github.com/neu-spiral/SSplain.
