Sanity Checks Revisited: An Exploration to Repair the Model Parameter Randomisation Test
Anna Hedström, Leander Weber, Sebastian Lapuschkin, Marina MC Höhne
TL;DR
The paper revisits the Model Parameter Randomisation Test (MPRT) for evaluating explanation faithfulness in XAI and identifies confounds such as top-down randomisation preserving information and noise-sensitive similarity metrics. It introduces sMPRT and eMPRT as remedies: sMPRT denoises attributions via sampling/smoothing, and eMPRT replaces pairwise similarity with an entropy-based attribution complexity measure, quantified through the rate of change under full randomisation. The methods are formalized with definitions for $\\Psi_{MPRT}$, $\\Psi_{sMPRT}$, and $\\Psi_{eMPRT}$ and use $H(\\Phi(\\cdot))$ to capture attribution complexity (with $n=100$ bins) to enable scalable, architecture-agnostic evaluation. Meta-evaluation on diverse datasets and architectures suggests improved reliability and efficiency, providing a more trustworthy framework for comparing XAI explanations in practice by focusing on sensitivity to parameter perturbations rather than noisy similarity metrics.
Abstract
The Model Parameter Randomisation Test (MPRT) is widely acknowledged in the eXplainable Artificial Intelligence (XAI) community for its well-motivated evaluative principle: that the explanation function should be sensitive to changes in the parameters of the model function. However, recent works have identified several methodological caveats for the empirical interpretation of MPRT. To address these caveats, we introduce two adaptations to the original MPRT -- Smooth MPRT and Efficient MPRT, where the former minimises the impact that noise has on the evaluation results through sampling and the latter circumvents the need for biased similarity measurements by re-interpreting the test through the explanation's rise in complexity, after full parameter randomisation. Our experimental results demonstrate that these proposed variants lead to improved metric reliability, thus enabling a more trustworthy application of XAI methods.
