SCoRe: Submodular Combinatorial Representation Learning
Anay Majee, Suraj Kothawade, Krishnateja Killamsetty, Rishabh Iyer
TL;DR
SCoRe addresses inter-class bias and intra-class variance in long-tail recognition by recasting representation learning as a set-based optimization over class-sets with submodular information measures. It introduces two core objectives, Total Information $L_{S_f}(\theta)$ and Total Correlation $L_{C_f}(\theta)$, and implements three instantiations—SCoRe-FL, SCoRe-GC, and SCoRe-LogDet—grounded in submodular functions; these formulations enable both intra-class compactness and inter-class separation while generalizing existing metric/contrastive losses. Empirically, SCoRe yields substantial gains across long-tail classification benchmarks (up to $7.6\%$) and object detection tasks (up to $19.4\%$), and demonstrates faster convergence and reduced inter-class bias compared to state-of-the-art methods. The framework’s versatility is evidenced by its ability to reproduce or improve upon SupCon, N-pairs, and OPL, and by its demonstrated applicability to large-scale, real-world imbalanced data. Overall, SCoRe provides a principled, scalable approach to robust representation learning in the presence of strong class imbalance.
Abstract
In this paper we introduce the SCoRe (Submodular Combinatorial Representation Learning) framework, a novel approach in representation learning that addresses inter-class bias and intra-class variance. SCoRe provides a new combinatorial viewpoint to representation learning, by introducing a family of loss functions based on set-based submodular information measures. We develop two novel combinatorial formulations for loss functions, using the Total Information and Total Correlation, that naturally minimize intra-class variance and inter-class bias. Several commonly used metric/contrastive learning loss functions like supervised contrastive loss, orthogonal projection loss, and N-pairs loss, are all instances of SCoRe, thereby underlining the versatility and applicability of SCoRe in a broad spectrum of learning scenarios. Novel objectives in SCoRe naturally model class-imbalance with up to 7.6\% improvement in classification on CIFAR-10-LT, CIFAR-100-LT, MedMNIST, 2.1% on ImageNet-LT, and 19.4% in object detection on IDD and LVIS (v1.0), demonstrating its effectiveness over existing approaches.
