Configurable Fairness: Direct Optimization of Parity Metrics via Vision-Language Models
Miao Zhang, Rumi Chunara
TL;DR
The paper tackles fairness gaps in image recognition arising from imbalanced groups and unknown sensitive attributes. It proposes a configurable fairness framework that directly optimizes parity-based metrics using vision-language model derived attribute relevancy as soft regularizers, enabling equality-of-opportunity, equalized odds, or accuracy parity without group labels. The approach yields differentiable loss terms that connect to each metric via PCC-based penalties, and experiments across CelebA, UTKFace, and Dogs & Cats show improved parity with robust performance under noisy attribute signals. This method offers a flexible, privacy-preserving path to fairness in vision tasks and highlights the ongoing trade-offs between different parity notions in real-world deployments.
Abstract
Performance disparities of image recognition across demographic groups are known to exist in deep learning-based models, due to imbalanced group representations or spurious correlation between group and target labels. Previous work has addressed such challenges without relying on expensive group labels, typically by upweighting high-loss samples or balancing discovered clusters. However, these heuristic strategies lack direct connection to specific fairness metrics and cannot guarantee optimization of parity-based criteria like equal opportunity, which ensures equal chance to receive positive outcomes across groups. In this work, we propose a novel paradigm that directly optimizes parity-based fairness metrics through specifically designed training objectives, without requiring group labels. We leverage vision-language models to analyze sensitive attribute relevancy for individual samples, then formulate loss functions that mathematically connect to each target fairness metric. This enables flexible optimization of different fairness criteria based on application needs. Experiments on multiple image classification datasets show that our metric-specific approach significantly improves parity-based fairness criteria and outperforms existing methods.
