A Rapid Test for Accuracy and Bias of Face Recognition Technology

Manuel Knott; Ignacio Serna; Ethan Mann; Pietro Perona

A Rapid Test for Accuracy and Bias of Face Recognition Technology

Manuel Knott, Ignacio Serna, Ethan Mann, Pietro Perona

TL;DR

This work tackles the challenge of accurately and affordably benchmarking 1:1 face verification across cloud FR services by eliminating manual ground-truth labeling. It introduces a rapid, annotation-free pipeline that sources recent web images for two datasets, infers ground-truth identities from ensemble embeddings via spectral factorization, and aggregates results across five cloud services to produce $FNMR$ vs. $FMR$ curves and bias metrics. Key contributions include an automated validation against hand-annotated labels, the first public multi-service benchmark of FR accuracy and demographic bias, and a framework that highlights biases such as poorer performance for Asian women in certain services. The approach promises to democratize FR testing, enabling rapid, scalable, privacy-conscious evaluation that informs developers, policymakers, and the public about accuracy and fairness in real-world deployments.

Abstract

Measuring the accuracy of face recognition (FR) systems is essential for improving performance and ensuring responsible use. Accuracy is typically estimated using large annotated datasets, which are costly and difficult to obtain. We propose a novel method for 1:1 face verification that benchmarks FR systems quickly and without manual annotation, starting from approximate labels (e.g., from web search results). Unlike previous methods for training set label cleaning, ours leverages the embedding representation of the models being evaluated, achieving high accuracy in smaller-sized test datasets. Our approach reliably estimates FR accuracy and ranking, significantly reducing the time and cost of manual labeling. We also introduce the first public benchmark of five FR cloud services, revealing demographic biases, particularly lower accuracy for Asian women. Our rapid test method can democratize FR testing, promoting scrutiny and responsible use of the technology. Our method is provided as a publicly accessible tool at https://github.com/caltechvisionlab/frt-rapid-test

A Rapid Test for Accuracy and Bias of Face Recognition Technology

TL;DR

Abstract

A Rapid Test for Accuracy and Bias of Face Recognition Technology

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)