A Fresh Look at Sanity Checks for Saliency Maps
Anna Hedström, Leander Weber, Sebastian Lapuschkin, Marina Höhne
TL;DR
This paper reevaluates the Model Parameter Randomisation Test (MPRT) for saliency-map explanations and identifies key methodological weaknesses related to preprocessing, layer-order, and similarity measures. It introduces two enhancements, Smooth MPRT (sMPRT) and Efficient MPRT (eMPRT), which respectively denoise attribution signals via input perturbations and quantify explanation faithfulness through increases in complexity measured by histogram entropy, thereby avoiding biased similarity metrics. Through extensive experiments on ImageNet, MNIST, and fMNIST with multiple architectures and attribution methods, the study demonstrates that both sMPRT and eMPRT improve metric reliability over the original MPRT, though no metric achieves perfect reliability and rankings can vary across tasks. The work provides a practical, publicly available toolkit for robust XAI evaluation and highlights the importance of using multiple, complementary metrics to assess attribution quality in real-world applications.
Abstract
The Model Parameter Randomisation Test (MPRT) is highly recognised in the eXplainable Artificial Intelligence (XAI) community due to its fundamental evaluative criterion: explanations should be sensitive to the parameters of the model they seek to explain. However, recent studies have raised several methodological concerns for the empirical interpretation of MPRT. In response, we propose two modifications to the original test: Smooth MPRT and Efficient MPRT. The former reduces the impact of noise on evaluation outcomes via sampling, while the latter avoids the need for biased similarity measurements by re-interpreting the test through the increase in explanation complexity after full model randomisation. Our experiments show that these modifications enhance the metric reliability, facilitating a more trustworthy deployment of explanation methods.
