Relevance-driven Input Dropout: an Explanation-guided Regularization Technique
Shreyas Gururaj, Lars Grüne, Wojciech Samek, Sebastian Lapuschkin, Leander Weber
TL;DR
This work tackles overfitting in data-scarce settings by introducing RelDrop, an explanation-guided input regularization that occludes the most attribution-sensitive input regions to force the model to leverage a broader set of features. By computing attribution maps and applying a balanced mask to inputs in both 2D images and 3D point clouds, RelDrop acts as an informed regularizer requiring only one extra backward pass per batch. Empirically, RelDrop improves generalization and robustness to occlusion across CIFAR, ImageNet, ModelNet40, and ShapeNet, with notable gains in zero-shot performance on distribution-shifted ImageNet variants and qualitative evidence of more distributed, semantically meaningful feature usage. The method offers a practical pathway to tighter generalization by integrating attribution signals into data augmentation and can be extended to other architectures and modalities in future work.
Abstract
Overfitting is a well-known issue extending even to state-of-the-art (SOTA) Machine Learning (ML) models, resulting in reduced generalization, and a significant train-test performance gap. Mitigation measures include a combination of dropout, data augmentation, weight decay, and other regularization techniques. Among the various data augmentation strategies, occlusion is a prominent technique that typically focuses on randomly masking regions of the input during training. Most of the existing literature emphasizes randomness in selecting and modifying the input features instead of regions that strongly influence model decisions. We propose Relevance-driven Input Dropout (RelDrop), a novel data augmentation method which selectively occludes the most relevant regions of the input, nudging the model to use other important features in the prediction process, thus improving model generalization through informed regularization. We further conduct qualitative and quantitative analyses to study how Relevance-driven Input Dropout (RelDrop) affects model decision-making. Through a series of experiments on benchmark datasets, we demonstrate that our approach improves robustness towards occlusion, results in models utilizing more features within the region of interest, and boosts inference time generalization performance. Our code is available at https://github.com/Shreyas-Gururaj/LRP_Relevance_Dropout.
