You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

Xiaotian Han; Tianlong Chen; Kaixiong Zhou; Zhimeng Jiang; Zhangyang Wang; Xia Hu

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

Xiaotian Han, Tianlong Chen, Kaixiong Zhou, Zhimeng Jiang, Zhangyang Wang, Xia Hu

TL;DR

This work tackles the problem of fixed accuracy-fairness trade-offs in fairness methods for high-stakes decisions. It introduces You Only Debias Once (YODO), a method that learns an objective-diverse subspace forming a line in weight space between an accuracy-optimum endpoint $\omega_1$ and a fairness-optimum endpoint $\omega_2$, parameterized by $\theta = (1-\alpha)\omega_1 + \alpha\omega_2$ for inference. By training with $\alpha \sim \mathrm{U}[0,1]$ and incorporating a cosine diversity regularizer, YODO enables arbitrary accuracy-fairness trade-offs from a single model without retraining, while supporting multiple fairness notions such as DP, EO, and Eodd. Empirical results on tabular and image data show competitive or superior Pareto fronts and smoother trade-off curves, with interpretable instance-level adjustments, demonstrating practical applicability and low overhead for real-world deployments.

Abstract

Deep neural networks are prone to various bias issues, jeopardizing their applications for high-stake decision-making. Existing fairness methods typically offer a fixed accuracy-fairness trade-off, since the weight of the well-trained model is a fixed point (fairness-optimum) in the weight space. Nevertheless, more flexible accuracy-fairness trade-offs at inference time are practically desired since: 1) stakes of the same downstream task can vary for different individuals, and 2) different regions have diverse laws or regularization for fairness. If using the previous fairness methods, we have to train multiple models, each offering a specific level of accuracy-fairness trade-off. This is often computationally expensive, time-consuming, and difficult to deploy, making it less practical for real-world applications. To address this problem, we propose You Only Debias Once (YODO) to achieve in-situ flexible accuracy-fairness trade-offs at inference time, using a single model that trained only once. Instead of pursuing one individual fixed point (fairness-optimum) in the weight space, we aim to find a "line" in the weight space that connects the accuracy-optimum and fairness-optimum points using a single model. Points (models) on this line implement varying levels of accuracy-fairness trade-offs. At inference time, by manually selecting the specific position of the learned "line", our proposed method can achieve arbitrary accuracy-fairness trade-offs for different end-users and scenarios. Experimental results on tabular and image datasets show that YODO achieves flexible trade-offs between model accuracy and fairness, at ultra-low overheads. For example, if we need $100$ levels of trade-off on the \acse dataset, YODO takes $3.53$ seconds while training $100$ fixed models consumes $425$ seconds. The code is available at https://github.com/ahxt/yodo.

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

TL;DR

Abstract

You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (26)