Fairness for the People, by the People: Minority Collective Action
Omri Ben-Dov, Samira Samadi, Amartya Sanyal, Alexandru Ţifrea
TL;DR
The paper addresses fairness in ML by shifting focus from firm-side interventions to Algorithmic Collective Action (ACA), where a minority group relabels their own data to influence the learned model without altering the training pipeline. It introduces three model-agnostic methods (RB-label, RB-dist, RB-prob) to approximate counterfactual labels and demonstrates, across diverse datasets, that a small minority can substantially reduce fairness violations (e.g., EqOd) with limited accuracy loss. The work connects ACA to counterfactual and fair representation fairness, provides theoretical bounds on success and estimation error, and analyzes practical limitations when the minority is isolated. Overall, it shows a feasible, user-driven fairness mechanism with meaningful improvements and clear boundaries, suggesting directions for safer and more scalable deployment.
Abstract
Machine learning models often preserve biases present in training data, leading to unfair treatment of certain minority groups. Despite an array of existing firm-side bias mitigation techniques, they typically incur utility costs and require organizational buy-in. Recognizing that many models rely on user-contributed data, end-users can induce fairness through the framework of Algorithmic Collective Action, where a coordinated minority group strategically relabels its own data to enhance fairness, without altering the firm's training process. We propose three practical, model-agnostic methods to approximate ideal relabeling and validate them on real-world datasets. Our findings show that a subgroup of the minority can substantially reduce unfairness with a small impact on the overall prediction error.
