Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models
Leander Girrbach, Stephan Alaniz, Genevieve Smith, Trevor Darrell, Zeynep Akata
TL;DR
This paper provides the first large-scale, person-centric audit of the LAION-400M dataset by annotating bounding boxes, perceived gender and race/ethnicity, and per-person captions. The authors demonstrate demographic imbalances and harmful associations in the data and show that 60-70% of gender bias observed in CLIP and Stable Diffusion can be explained by direct co-occurrences in the training data. They introduce a robust automatic labeling pipeline, validate high labeling accuracy, and perform SAE-based analyses to uncover identity-linked topics. The work establishes a principled link between pretraining data composition and downstream model bias and offers a foundation for dataset rebalancing and fairer AI systems, while acknowledging ethical and methodological limitations of perceived demographics.
Abstract
Vision-language models trained on large-scale multimodal datasets show strong demographic biases, but the role of training data in producing these biases remains unclear. A major barrier has been the lack of demographic annotations in web-scale datasets such as LAION-400M. We address this gap by creating person-centric annotations for the full dataset, including over 276 million bounding boxes, perceived gender and race/ethnicity labels, and automatically generated captions. These annotations are produced through validated automatic labeling pipelines combining object detection, multimodal captioning, and finetuned classifiers. Using them, we uncover demographic imbalances and harmful associations, such as the disproportionate linking of men and individuals perceived as Black or Middle Eastern with crime-related and negative content. We also show that 60-70% of gender bias in CLIP and Stable Diffusion can be linearly explained by direct co-occurrences in the data. Our resources establish the first large-scale empirical link between dataset composition and downstream model bias.
