Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps

Yoojin Oh; Junhyug Noh

Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps

Yoojin Oh, Junhyug Noh

TL;DR

This work identifies two fundamental distortions introduced by softmax in CAM explanations—additive logit shifts and sign collapse—which can mislead localization. It proposes a simple, architecture-agnostic solution: a dual-branch sigmoid head that clones the classifier head, trains a per-class sigmoid branch with binary supervision, and uses the sigmoid branch to generate signed, magnitude-preserving heatmaps while keeping the original softmax head frozen. Inference preserves recognition accuracy and yields more faithful explanations by constraining heatmap contributions to positive evidence from the sigmoid branch, compatible with existing CAM variants and WSOL pipelines. Extensive experiments on fine-grained datasets (CUB-200-2011, Stanford Cars) and WSOL benchmarks (ImageNet-1K, OpenImages-30K) show consistent improvements in explanation fidelity and localization without accuracy loss, with only modest training and inference overhead.

Abstract

Class Activation Mapping (CAM) and its extensions have become indispensable tools for visualizing the evidence behind deep network predictions. However, by relying on a final softmax classifier, these methods suffer from two fundamental distortions: additive logit shifts that arbitrarily bias importance scores, and sign collapse that conflates excitatory and inhibitory features. We propose a simple, architecture-agnostic dual-branch sigmoid head that decouples localization from classification. Given any pretrained model, we clone its classification head into a parallel branch ending in per-class sigmoid outputs, freeze the original softmax head, and fine-tune only the sigmoid branch with class-balanced binary supervision. At inference, softmax retains recognition accuracy, while class evidence maps are generated from the sigmoid branch -- preserving both magnitude and sign of feature contributions. Our method integrates seamlessly with most CAM variants and incurs negligible overhead. Extensive evaluations on fine-grained tasks (CUB-200-2011, Stanford Cars) and WSOL benchmarks (ImageNet-1K, OpenImages30K) show improved explanation fidelity and consistent Top-1 Localization gains -- without any drop in classification accuracy. Code is available at https://github.com/finallyupper/beyond-softmax.

Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps

TL;DR

Abstract

Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)