ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts

Jinho Choi; Hyesu Lim; Steffen Schneider; Jaegul Choo

ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts

Jinho Choi, Hyesu Lim, Steffen Schneider, Jaegul Choo

TL;DR

ConceptScope presents an automated pipeline to audit visual datasets by extracting a large, interpretable dictionary of visual concepts via Sparse Autoencoders trained on vision foundation-model representations. Concepts are categorized per class into target, context, and bias using semantic alignment and concept-strength metrics, enabling class-level bias analysis and robustness evaluation. The approach yields accurate concept prediction and localization, detects known biases, uncovers novel biases in real-world data, and provides a framework for diagnosing model robustness under concept distribution shifts without external OOD data. By equipping practitioners with automatic bias discovery and region-specific bias localization, ConceptScope offers a practical tool for dataset auditing, bias mitigation, and model diagnostics in real-world vision tasks.

Abstract

Dataset bias, where data points are skewed to certain concepts, is ubiquitous in machine learning datasets. Yet, systematically identifying these biases is challenging without costly, fine-grained attribute annotations. We present ConceptScope, a scalable and automated framework for analyzing visual datasets by discovering and quantifying human-interpretable concepts using Sparse Autoencoders trained on representations from vision foundation models. ConceptScope categorizes concepts into target, context, and bias types based on their semantic relevance and statistical correlation to class labels, enabling class-level dataset characterization, bias identification, and robustness evaluation through concept-based subgrouping. We validate that ConceptScope captures a wide range of visual concepts, including objects, textures, backgrounds, facial attributes, emotions, and actions, through comparisons with annotated datasets. Furthermore, we show that concept activations produce spatial attributions that align with semantically meaningful image regions. ConceptScope reliably detects known biases (e.g., background bias in Waterbirds) and uncovers previously unannotated ones (e.g, co-occurring objects in ImageNet), offering a practical tool for dataset auditing and model diagnostics.

ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts

TL;DR

Abstract

ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)