Axiomatization of Gradient Smoothing in Neural Networks
Linjiang Zhou, Xiaochuan Shi, Chao Ma, Zepeng Wang
TL;DR
This work addresses the lack of theoretical grounding for gradient smoothing in neural networks by formulating Monte Carlo Gradient Mollification, anchored in function mollification and Monte Carlo integration. It shows that SmoothGrad, NoiseGrad, and FusionGrad are special cases within a unified axiomatic framework and introduces new kernel-based smoothing methods, guided by convergence and hyperparameter analysis. The authors provide mathematical proofs of unbiasedness and consistency for the Monte Carlo estimator and offer practical guidance on kernel design and parameter settings. Experimental results across multiple datasets and models demonstrate how kernel choice and smoothing mode affect explainability metrics, underscoring the framework's potential to improve gradient-based explanations with principled methods.
Abstract
Gradients play a pivotal role in neural networks explanation. The inherent high dimensionality and structural complexity of neural networks result in the original gradients containing a significant amount of noise. While several approaches were proposed to reduce noise with smoothing, there is little discussion of the rationale behind smoothing gradients in neural networks. In this work, we proposed a gradient smooth theoretical framework for neural networks based on the function mollification and Monte Carlo integration. The framework intrinsically axiomatized gradient smoothing and reveals the rationale of existing methods. Furthermore, we provided an approach to design new smooth methods derived from the framework. By experimental measurement of several newly designed smooth methods, we demonstrated the research potential of our framework.
