Balancing Tails when Comparing Distributions: Comprehensive Equity Index (CEI) with Application to Bias Evaluation in Operational Face Biometrics
Imanol Solano, Julian Fierrez, Aythami Morales, Alejandro Peña, Ruben Tolosana, Francisco Zamora-Martinez, Javier San Agustin
TL;DR
This work tackles bias in high-performance face recognition by introducing the Comprehensive Equity Index (CEI), a tail-aware, threshold-agnostic metric that analyzes genuine and impostor score distributions separately. CEI splits each distribution at a percentile to emphasize distribution tails and uses KL divergence to compare to a mean distribution, with CEIN and CEIE variants; an automated CEIA version further selects splitting percentiles and tail weights. Through synthetic and real-world experiments across diverse datasets and biased training conditions, CEI and CEIA demonstrate superior sensitivity to subtle demographic biases in both tail regions and overall shapes, outperforming prior distribution-based metrics. The approach provides a practically useful, adaptable fairness tool that complements traditional FR metrics and can be extended to other domains involving tail analysis of statistical distributions.
Abstract
Demographic bias in high-performance face recognition (FR) systems often eludes detection by existing metrics, especially with respect to subtle disparities in the tails of the score distribution. We introduce the Comprehensive Equity Index (CEI), a novel metric designed to address this limitation. CEI uniquely analyzes genuine and impostor score distributions separately, enabling a configurable focus on tail probabilities while also considering overall distribution shapes. Our extensive experiments (evaluating state-of-the-art FR systems, intentionally biased models, and diverse datasets) confirm CEI's superior ability to detect nuanced biases where previous methods fall short. Furthermore, we present CEI^A, an automated version of the metric that enhances objectivity and simplifies practical application. CEI provides a robust and sensitive tool for operational FR fairness assessment. The proposed methods have been developed particularly for bias evaluation in face biometrics but, in general, they are applicable for comparing statistical distributions in any problem where one is interested in analyzing the distribution tails.
