SemCovNet: Towards Fair and Semantic Coverage-Aware Learning for Underrepresented Visual Concepts
Sakib Ahammed, Xia Cui, Xinqi Fan, Wenqi Lu, Moi Hoon Yap
TL;DR
SemCovNet tackles Semantic Coverage Imbalance (SCI), a descriptor-level fairness issue where underrepresented semantic concepts co-occur with inconsistent coverage across classes and subgroups. It introduces a Semantic Descriptor Map (SDM), Descriptor Attention Modulation (DAM), and Descriptor–Visual Alignment (DVA), combined with a Coverage Disparity Index (CDI) to regularize and align semantic coverage with model error. Empirical results on MILK10k and ISIC-DICM-17K show substantial reductions in CDI and improved tail performance, while maintaining calibration and enabling cross-domain generalization to CelebA. The work formalizes semantic fairness, demonstrates a closed-loop learning approach, and provides a foundation for interpretable, fair vision learning in both medical and non-medical domains.
Abstract
Modern vision models increasingly rely on rich semantic representations that extend beyond class labels to include descriptive concepts and contextual attributes. However, existing datasets exhibit Semantic Coverage Imbalance (SCI), a previously overlooked bias arising from the long-tailed semantic representations. Unlike class imbalance, SCI occurs at the semantic level, affecting how models learn and reason about rare yet meaningful semantics. To mitigate SCI, we propose Semantic Coverage-Aware Network (SemCovNet), a novel model that explicitly learns to correct semantic coverage disparities. SemCovNet integrates a Semantic Descriptor Map (SDM) for learning semantic representations, a Descriptor Attention Modulation (DAM) module that dynamically weights visual and concept features, and a Descriptor-Visual Alignment (DVA) loss that aligns visual features with descriptor semantics. We quantify semantic fairness using a Coverage Disparity Index (CDI), which measures the alignment between coverage and error. Extensive experiments across multiple datasets demonstrate that SemCovNet enhances model reliability and substantially reduces CDI, achieving fairer and more equitable performance. This work establishes SCI as a measurable and correctable bias, providing a foundation for advancing semantic fairness and interpretable vision learning.
