FairSync: Ensuring Amortized Group Exposure in Distributed Recommendation Retrieval
Chen Xu, Jun Xu, Yiming Ding, Xiao Zhang, Qi Qi
TL;DR
The paper tackles the problem of ensuring amortized group max-min fairness in the retrieval stage of distributed recommender systems, a prerequisite for reliable downstream exposure in stage-2. It introduces FairSync, a dual-space distributed optimization framework in which a central vector $\boldsymbol{\mu}$ aggregates historical fairness signals and guides per-server retrieval through dual-space queries, updated periodically by gradient-based optimization. The approach builds on a distributed dense retrieval architecture and derives a low-dimensional dual formulation $W^{Dual}$, enabling scalable, online, and distributive enforcement of group exposure constraints $e_g \ge m_g$ over horizon $T$. Empirical results on two public datasets show that FairSync achieves the target exposure levels (ESP) while preserving high retrieval accuracy (Recall, NDCG, HR) across multiple base models, outperforming baseline fairness methods that either fail the exposure constraint or degrade accuracy. The work demonstrates the practical viability of incorporating amortized fairness directly into the retrieval stage and offers a scalable path for fair and efficient distributed recommender systems.
Abstract
In pursuit of fairness and balanced development, recommender systems (RS) often prioritize group fairness, ensuring that specific groups maintain a minimum level of exposure over a given period. For example, RS platforms aim to ensure adequate exposure for new providers or specific categories of items according to their needs. Modern industry RS usually adopts a two-stage pipeline: stage-1 (retrieval stage) retrieves hundreds of candidates from millions of items distributed across various servers, and stage-2 (ranking stage) focuses on presenting a small-size but accurate selection from items chosen in stage-1. Existing efforts for ensuring amortized group exposures focus on stage-2, however, stage-1 is also critical for the task. Without a high-quality set of candidates, the stage-2 ranker cannot ensure the required exposure of groups. Previous fairness-aware works designed for stage-2 typically require accessing and traversing all items. In stage-1, however, millions of items are distributively stored in servers, making it infeasible to traverse all of them. How to ensure group exposures in the distributed retrieval process is a challenging question. To address this issue, we introduce a model named FairSync, which transforms the problem into a constrained distributed optimization problem. Specifically, FairSync resolves the issue by moving it to the dual space, where a central node aggregates historical fairness data into a vector and distributes it to all servers. To trade off the efficiency and accuracy, the gradient descent technique is used to periodically update the parameter of the dual vector. The experiment results on two public recommender retrieval datasets showcased that FairSync outperformed all the baselines, achieving the desired minimum level of exposures while maintaining a high level of retrieval accuracy.
