Reduction of Class Activation Uncertainty with Background Information
H M Dipu Kabir
TL;DR
This work introduces a background class to reduce class activation uncertainty and improve generalization with lower computational cost than multitask learning. By generating diverse, non-target background images and training with an extra background output, the method shifts focus in the head layer toward robust, widely-activated features, as analyzed through CAM. Across varied datasets and architectures, including Vision Transformers, the approach yields improved or competitive accuracy with reduced training overhead and shows SOTA or near-SOTA results on several benchmarks. The study also discusses background-class generation principles, ablation results, and practical considerations, framing future work in uncertainty scores and broader applications.
Abstract
Multitask learning is a popular approach to training high-performing neural networks with improved generalization. In this paper, we propose a background class to achieve improved generalization at a lower computation compared to multitask learning to help researchers and organizations with limited computation power. We also present a methodology for selecting background images and discuss potential future improvements. We apply our approach to several datasets and achieve improved generalization with much lower computation. Through the class activation mappings (CAMs) of the trained models, we observed the tendency towards looking at a bigger picture with the proposed model training methodology. Applying the vision transformer with the proposed background class, we receive state-of-the-art (SOTA) performance on CIFAR-10C, Caltech-101, and CINIC-10 datasets. Example scripts are available in the `CAM' folder of the following GitHub Repository: github.com/dipuk0506/UQ
