Reflective-Net: Learning from Explanations
Johannes Schneider, Michalis Vlachos
TL;DR
Reflective-Net introduces a self-reflective learning paradigm that uses explanations from explainability methods to augment training data and perform a brief second, deliberative pass during inference. By extending GradCAM to intermediate layers and concatenating multi-channel explanations with inputs, the method enables learning from multiple hypothetical outcomes (correct, predicted, random) and from different explanation sources. Empirical results across CIFAR, SVHN, FashionMNIST, and TinyImageNet show improved accuracy and faster early convergence compared to non-reflective baselines, with the strongest gains when explanations for the correct class are used at test time and when training leverages multiple explanations. The work highlights both the potential and the computational trade-offs of explanation-based self-reflection, and it opens avenues for data augmentation and alternative integration mechanisms like attention-based designs.
Abstract
We examine whether data generated by explanation techniques, which promote a process of self-reflection, can improve classifier performance. Our work is based on the idea that humans have the ability to make quick, intuitive decisions as well as to reflect on their own thinking and learn from explanations. To the best of our knowledge, this is the first time that the potential of mimicking this process by using explanations generated by explainability methods has been explored. We found that combining explanations with traditional labeled data leads to significant improvements in classification accuracy and training efficiency across multiple image classification datasets and convolutional neural network architectures. It is worth noting that during training, we not only used explanations for the correct or predicted class, but also for other classes. This serves multiple purposes, including allowing for reflection on potential outcomes and enriching the data through augmentation.
