Table of Contents
Fetching ...

Algorithmic Collective Action in Machine Learning

Moritz Hardt, Eric Mazumdar, Celestine Mendler-Dünner, Tijana Zrnic

TL;DR

The paper develops a principled model of algorithmic collective action on ML platforms, where a small fraction $\alpha$ of participants coordinates data edits to steer the firm’s learning on the mixed distribution $\mathcal{P}=\alpha\mathcal{P}^*+(1-\alpha)\mathcal{P}_0$. It analyzes three learning contexts—classification, convex risk minimization, and gradient-based learning—proposing strategies such as signal planting, signal erasing, gradient neutralization, and gradient control, and derives quantitative thresholds for the collective size needed to achieve targets. The authors validate the theory with an extensive resume classification experiment using a BERT-like model, showing that collectives with vanishing $\alpha$ can command substantial influence, with the threshold depending on signal uniqueness, suboptimality, and training dynamics. The results illuminate practical risks and policy implications for ML powered platforms, revealing how high dimensionality and weak signals can amplify the leverage of small collectives while also guiding defenses against data manipulation attacks.

Abstract

We initiate a principled study of algorithmic collective action on digital platforms that deploy machine learning algorithms. We propose a simple theoretical model of a collective interacting with a firm's learning algorithm. The collective pools the data of participating individuals and executes an algorithmic strategy by instructing participants how to modify their own data to achieve a collective goal. We investigate the consequences of this model in three fundamental learning-theoretic settings: the case of a nonparametric optimal learning algorithm, a parametric risk minimizer, and gradient-based optimization. In each setting, we come up with coordinated algorithmic strategies and characterize natural success criteria as a function of the collective's size. Complementing our theory, we conduct systematic experiments on a skill classification task involving tens of thousands of resumes from a gig platform for freelancers. Through more than two thousand model training runs of a BERT-like language model, we see a striking correspondence emerge between our empirical observations and the predictions made by our theory. Taken together, our theory and experiments broadly support the conclusion that algorithmic collectives of exceedingly small fractional size can exert significant control over a platform's learning algorithm.

Algorithmic Collective Action in Machine Learning

TL;DR

The paper develops a principled model of algorithmic collective action on ML platforms, where a small fraction of participants coordinates data edits to steer the firm’s learning on the mixed distribution . It analyzes three learning contexts—classification, convex risk minimization, and gradient-based learning—proposing strategies such as signal planting, signal erasing, gradient neutralization, and gradient control, and derives quantitative thresholds for the collective size needed to achieve targets. The authors validate the theory with an extensive resume classification experiment using a BERT-like model, showing that collectives with vanishing can command substantial influence, with the threshold depending on signal uniqueness, suboptimality, and training dynamics. The results illuminate practical risks and policy implications for ML powered platforms, revealing how high dimensionality and weak signals can amplify the leverage of small collectives while also guiding defenses against data manipulation attacks.

Abstract

We initiate a principled study of algorithmic collective action on digital platforms that deploy machine learning algorithms. We propose a simple theoretical model of a collective interacting with a firm's learning algorithm. The collective pools the data of participating individuals and executes an algorithmic strategy by instructing participants how to modify their own data to achieve a collective goal. We investigate the consequences of this model in three fundamental learning-theoretic settings: the case of a nonparametric optimal learning algorithm, a parametric risk minimizer, and gradient-based optimization. In each setting, we come up with coordinated algorithmic strategies and characterize natural success criteria as a function of the collective's size. Complementing our theory, we conduct systematic experiments on a skill classification task involving tens of thousands of resumes from a gig platform for freelancers. Through more than two thousand model training runs of a BERT-like language model, we see a striking correspondence emerge between our empirical observations and the predictions made by our theory. Taken together, our theory and experiments broadly support the conclusion that algorithmic collectives of exceedingly small fractional size can exert significant control over a platform's learning algorithm.
Paper Structure (44 sections, 11 theorems, 54 equations, 7 figures)

This paper contains 44 sections, 11 theorems, 54 equations, 7 figures.

Key Result

Theorem 1

Consider the feature--label signal strategy and suppose that the signal is $\xi$-unique. Then, the success against a classifier that is $\epsilon$-suboptimal on the signal set $\mathcal{X}^*$ is lower bounded by

Figures (7)

  • Figure 1: Illustration of the success rate predicted by Theorem \ref{['thm:trigger-label']}. In the first we fix $\epsilon=0$ and vary $\xi$, and in the second we fix $\xi$ and vary the classifier's suboptimality, $\epsilon$. We upper bound the suboptimality gap as $\Delta\leq 1$.
  • Figure 2: Success rate of Strategy 1 as the collective size varies. Each dot represents one model training run. The solid line is a best-fit sigmoid function with two shape parameters and one offset term.
  • Figure 3: Success rate of Strategy 2 as the collective size varies. Each dot represents one model training run. The solid line is a best-fit sigmoid function with two shape parameters and one offset term.
  • Figure 4: Random labels increase success of Strategy 2.
  • Figure 5: Additional epochs of training increase the success rate.
  • ...and 2 more figures

Theorems & Definitions (22)

  • Definition 1: Critical mass
  • Definition 2: $\epsilon$-suboptimal classifier
  • Definition 3: $\xi$-unique signal
  • Theorem 1
  • Corollary 2
  • Theorem 3
  • Corollary 4
  • Theorem 5
  • Corollary 6
  • Definition 4: Risk minimizer
  • ...and 12 more