EVO-LRP: Evolutionary Optimization of LRP for Interpretable Model Explanations
Emerald Zhang, Julian Weaver, Samantha R Santacruz, Edward Castillo
TL;DR
EVO-LRP tackles the challenge of producing interpretable explanations for deep networks by optimizing Layer-wise Relevance Propagation (LRP) rules with Covariance Matrix Adaptation Evolution Strategy (CMA-ES). By directly tuning LRP hyperparameters against objective interpretability metrics ($FC$, $SP$, $AS$), it yields explanations that are more faithful, concise, and robust, and that adapt to specific target classes. The framework demonstrates that principled, task-aligned optimization can systematically improve attribution quality and enable class-discriminative heatmaps, including complementary composite maps. Empirically, EVO-LRP outperforms standard baselines on ImageNet with clear qualitative gains in the coherence and discriminative power of class-specific relevance maps, suggesting practical impact for high-stakes domains requiring transparent decision-making.
Abstract
Explainable AI (XAI) methods help identify which image regions influence a model's prediction, but often face a trade-off between detail and interpretability. Layer-wise Relevance Propagation (LRP) offers a model-aware alternative. However, LRP implementations commonly rely on heuristic rule sets that are not optimized for clarity or alignment with model behavior. We introduce EVO-LRP, a method that applies Covariance Matrix Adaptation Evolution Strategy (CMA-ES) to tune LRP hyperparameters based on quantitative interpretability metrics, such as faithfulness or sparseness. EVO-LRP outperforms traditional XAI approaches in both interpretability metric performance and visual coherence, with strong sensitivity to class-specific features. These findings demonstrate that attribution quality can be systematically improved through principled, task-specific optimization.
