Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach
Tianhao Shi, Yang Zhang, Jizhi Zhang, Fuli Feng, Xiangnan He
TL;DR
This work tackles fairness in recommender systems when only a subset of sensitive attributes is available, a setting motivated by privacy and legal constraints. It introduces Distributionally Robust Fair Optimization (DRFO), which hedges against reconstruction errors by minimizing worst-case unfairness over an ambiguity set of distributions around reconstructed attributes. The method combines building a TV-distance-based ambiguity set from reconstruction errors with a DRO objective that enforces fairness across all distributions in the set, showing theoretical robustness and empirical gains. Across MovieLens-1M and Tenrec, DRFO outperforms reconstruction-based baselines, particularly as the proportion of unknown sensitive attributes grows, while incurring only modest losses in recommendation accuracy. The approach also accommodates users who refuse reconstruction by using broad ambiguity sets, preserving fairness in real-world, privacy-conscious scenarios.
Abstract
As recommender systems are indispensable in various domains such as job searching and e-commerce, providing equitable recommendations to users with different sensitive attributes becomes an imperative requirement. Prior approaches for enhancing fairness in recommender systems presume the availability of all sensitive attributes, which can be difficult to obtain due to privacy concerns or inadequate means of capturing these attributes. In practice, the efficacy of these approaches is limited, pushing us to investigate ways of promoting fairness with limited sensitive attribute information. Toward this goal, it is important to reconstruct missing sensitive attributes. Nevertheless, reconstruction errors are inevitable due to the complexity of real-world sensitive attribute reconstruction problems and legal regulations. Thus, we pursue fair learning methods that are robust to reconstruction errors. To this end, we propose Distributionally Robust Fair Optimization (DRFO), which minimizes the worst-case unfairness over all potential probability distributions of missing sensitive attributes instead of the reconstructed one to account for the impact of the reconstruction errors. We provide theoretical and empirical evidence to demonstrate that our method can effectively ensure fairness in recommender systems when only limited sensitive attributes are accessible.
