Synthetic Counterfactual Faces
Guruprasad V Ramesh, Harrison Rosenberg, Ashish Hooda, Shimaa Ahmed Kassem Fawaz
TL;DR
This work tackles the challenge of evaluating fairness and robustness of face understanding systems under semantic distribution shifts by generating targeted counterfactual faces. It introduces an end-to-end diffusion-based pipeline that creates source faces and applies semantic edits via a latent manipulator, with filtering guided by distortion and attribute detectors. The authors generate about 15k counterfactual faces across 8 demographic groups and 19 attributes, validate the data with human studies, and demonstrate a practical vulnerability in a leading commercial model (Instagram Android Image Understanding). The study provides a controllable, validated dataset and methodology for targeted bias analysis and robustness evaluation in real-world vision systems.
Abstract
Computer vision systems have been deployed in various applications involving biometrics like human faces. These systems can identify social media users, search for missing persons, and verify identity of individuals. While computer vision models are often evaluated for accuracy on available benchmarks, more annotated data is necessary to learn about their robustness and fairness against semantic distributional shifts in input data, especially in face data. Among annotated data, counterfactual examples grant strong explainability characteristics. Because collecting natural face data is prohibitively expensive, we put forth a generative AI-based framework to construct targeted, counterfactual, high-quality synthetic face data. Our synthetic data pipeline has many use cases, including face recognition systems sensitivity evaluations and image understanding system probes. The pipeline is validated with multiple user studies. We showcase the efficacy of our face generation pipeline on a leading commercial vision model. We identify facial attributes that cause vision systems to fail.
