Frequency-Aware Model Parameter Explorer: A new attribution method for improving explainability
Ali Yavari, Alireza Mohamadi, Elham Beydaghi, Rainer A. Leitgeb
TL;DR
This work tackles explainability of DNNs under real-world perturbations by introducing transferable frequency-aware adversarial attacks and the FAMPE attribution method. By decoupling high- and low-frequency perturbations with an alpha-controlled mix and an energy-based cutoff, FAMPE produces more precise attribution maps than prior methods, notably AttEXplore. Across ImageNet experiments on multiple architectures, FAMPE achieves substantial improvements in Insertion Score and demonstrates the value of incorporating frequency-domain perturbations into explainability. The authors also provide ablations to reveal the contributions of each frequency band and plan to release the code for reproducibility and further research.
Abstract
Ensuring the reliability of deep neural networks (DNNs) in the presence of real world noise and intentional perturbations remains a significant challenge. To address this, attribution methods have been proposed, though their efficacy remains suboptimal and necessitates further refinement. In this paper, we propose a novel category of transferable adversarial attacks, called transferable frequency-aware attacks, enabling frequency-aware exploration via both high-and low-frequency components. Based on this type of attacks, we also propose a novel attribution method, named Frequency-Aware Model Parameter Explorer (FAMPE), which improves the explainability for DNNs. Relative to the current state-of-the-art method AttEXplore, our FAMPE attains an average gain of 13.02% in Insertion Score, thereby outperforming existing approaches. Through detailed ablation studies, we also investigate the role of both high- and low-frequency components in explainability.
