Covert Attacks on Machine Learning Training in Passively Secure MPC
Matthew Jagielski, Daniel Escudero, Rahul Rachuri, Peter Scholl
TL;DR
The paper shows that active adversaries can stealthily compromise ML models trained with passively secure MPC, challenging the assumption that passive security suffices. It develops low-level MPC attacks targeting activations and comparisons, and combines them with high-level gradient-based manipulation strategies (gradient zeroing, shifting, and scaling) to realize data poisoning and privacy breaches. Through plaintext simulations on standard datasets, the authors demonstrate dramatic effects including backdoors, targeted misclassification, and dramatic increases in membership inference success, often with minimal degradation to overall model accuracy. The work highlights the need for actively secure MPC in PPML and discusses practical mitigations and future research directions at the intersection of MPC and adversarial ML.
Abstract
Secure multiparty computation (MPC) allows data owners to train machine learning models on combined data while keeping the underlying training data private. The MPC threat model either considers an adversary who passively corrupts some parties without affecting their overall behavior, or an adversary who actively modifies the behavior of corrupt parties. It has been argued that in some settings, active security is not a major concern, partly because of the potential risk of reputation loss if a party is detected cheating. In this work we show explicit, simple, and effective attacks that an active adversary can run on existing passively secure MPC training protocols, while keeping essentially zero risk of the attack being detected. The attacks we show can compromise both the integrity and privacy of the model, including attacks reconstructing exact training data. Our results challenge the belief that a threat model that does not include malicious behavior by the involved parties may be reasonable in the context of PPML, motivating the use of actively secure protocols for training.
