Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes
Yusuke Hirota, Jerone T. A. Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang
TL;DR
The paper tackles societal bias in image-text datasets by decorrelating protected groups from all image attributes through synthetic counterfactual data generated by text-guided inpainting. It introduces data filtering to mitigate biases arising from inpainting and demonstrates substantial bias reduction on multi-label classification and image captioning tasks without sacrificing performance. Evaluations across COCO and OpenImages with various backbones show improved leakage and bias ratio metrics, while preserving METEOR and CLIPScore-based caption quality. The work also discusses ethical considerations, limitations of real-vs-synthetic data augmentation, and directions for extending the framework to additional protected groups.
Abstract
We tackle societal bias in image-text datasets by removing spurious correlations between protected groups and image attributes. Traditional methods only target labeled attributes, ignoring biases from unlabeled ones. Using text-guided inpainting models, our approach ensures protected group independence from all attributes and mitigates inpainting biases through data filtering. Evaluations on multi-label image classification and image captioning tasks show our method effectively reduces bias without compromising performance across various models.
