The Role of Learning Algorithms in Collective Action
Omri Ben-Dov, Jake Fawkes, Samira Samadi, Amartya Sanyal
TL;DR
The paper addresses how learning algorithms influence the success of collective action in ML by moving beyond Bayes-optimal analyses to algorithm-dependent phenomena. It introduces a formal framework for planting signals within mixture distributions and analyzes two main algorithm families: Distributionally Robust Optimisation (DRO) and gradient-descent-based methods with simplicity bias (e.g., SGD). The authors prove and validate that the effective collective size $\alpha_{\text{eff}}$ and the overall success $S(\alpha)$ depend critically on algorithmic properties, showing that DRO can amplify impact for small collectives while iterative re-weighting and validation strategies can dramatically alter outcomes. They also demonstrate how algorithmic biases can be exploited to design more effective signals, with substantial empirical evidence from synthetic data, CIFAR-10, Waterbirds, and MNIST-CIFAR-inspired setups. Overall, the work highlights the necessity of considering learning algorithms when evaluating and designing collective action in ML systems, and outlines future directions for broader algorithmic classes and safety considerations.
Abstract
Collective action in machine learning is the study of the control that a coordinated group can have over machine learning algorithms. While previous research has concentrated on assessing the impact of collectives against Bayes (sub-)optimal classifiers, this perspective is limited in that it does not account for the choice of learning algorithm. Since classifiers seldom behave like Bayes classifiers and are influenced by the choice of learning algorithms along with their inherent biases, in this work we initiate the study of how the choice of the learning algorithm plays a role in the success of a collective in practical settings. Specifically, we focus on distributionally robust optimization (DRO), popular for improving a worst group error, and on the ubiquitous stochastic gradient descent (SGD), due to its inductive bias for "simpler" functions. Our empirical results, supported by a theoretical foundation, show that the effective size and success of the collective are highly dependent on properties of the learning algorithm. This highlights the necessity of taking the learning algorithm into account when studying the impact of collective action in machine learning.
