A Practical and Secure Byzantine Robust Aggregator
De Zhang Lee, Aashish Kolluri, Prateek Saxena, Ee-Chien Chang
TL;DR
This work tackles data-poisoning threats in ML by proposing RandEigen, a Byzantine robust aggregator that achieves a near-optimal, dimension-independent bias with quasi-linear runtime. It introduces two main innovations: a fast randomized method to approximate the dominant eigenvector and a convergence-based stopping rule that replaces fixed thresholds, combined with Johnson-Lindenstrauss dimensionality reduction to reduce computation. Theoretical results establish information-theoretic bias guarantees under a stability model, and extensive experiments across federated and centralized settings show RandEigen effectively mitigates a wide range of attacks, including the adaptive HiDRA attack, with substantial speedups and minimal impact on accuracy when no attack is present. These findings indicate RandEigen is practical for real-world neural network training and can serve as a robust defense against data poisoning in high-dimensional gradient spaces.
Abstract
In machine learning security, one is often faced with the problem of removing outliers from a given set of high-dimensional vectors when computing their average. For example, many variants of data poisoning attacks produce gradient vectors during training that are outliers in the distribution of clean gradients, which bias the computed average used to derive the ML model. Filtering them out before averaging serves as a generic defense strategy. Byzantine robust aggregation is an algorithmic primitive which computes a robust average of vectors, in the presence of an $ε$ fraction of vectors which may have been arbitrarily and adaptively corrupted, such that the resulting bias in the final average is provably bounded. In this paper, we give the first robust aggregator that runs in quasi-linear time in the size of input vectors and provably has near-optimal bias bounds. Our algorithm also does not assume any knowledge of the distribution of clean vectors, nor does it require pre-computing any filtering thresholds from it. This makes it practical to use directly in standard neural network training procedures. We empirically confirm its expected runtime efficiency and its effectiveness in nullifying 10 different ML poisoning attacks.
