RAIFLE: Reconstruction Attacks on Interaction-based Federated Learning with Adversarial Data Manipulation
Dzung Pham, Shreyas Kulkarni, Amir Houmansadr
TL;DR
This work reveals a privacy vulnerability in interaction-based federated learning (IFL) where a central server with control over item features can actively manipulate data through Adversarial Data Manipulation (ADM) to significantly improve reconstruction of users' private interactions. The authors introduce RAIFLE, an optimization-based attack framework that outperforms gradient inversion across federated RS and OLTR settings, including image-based modalities, by jointly reconstructing interactions and, when possible, user embeddings; ADM techniques include fingerprinting and noise injection, with indirect manipulation for cases of limited feature control. They evaluate RAIFLE on RS (MovieLens-100K, Steam-200K) and OLTR (LETOR/MSLR, ImageNet-based) datasets, demonstrating strong reconstruction performance (often AUC near 0.9–1.0) and showing that standard defenses like Local DP and Secure Aggregation can be overcome under certain conditions, though at utility costs. The paper discusses countermeasures, utility implications, and practical considerations, highlighting a significant privacy risk in IFL and outlining directions for secure, private designs in RS/OLTR and broader interactive learning scenarios.
Abstract
Federated learning has emerged as a promising privacy-preserving solution for machine learning domains that rely on user interactions, particularly recommender systems and online learning to rank. While there has been substantial research on the privacy of traditional federated learning, little attention has been paid to the privacy properties of these interaction-based settings. In this work, we show that users face an elevated risk of having their private interactions reconstructed by the central server when the server can control the training features of the items that users interact with. We introduce RAIFLE, a novel optimization-based attack framework where the server actively manipulates the features of the items presented to users to increase the success rate of reconstruction. Our experiments with federated recommendation and online learning-to-rank scenarios demonstrate that RAIFLE is significantly more powerful than existing reconstruction attacks like gradient inversion, achieving high performance consistently in most settings. We discuss the pros and cons of several possible countermeasures to defend against RAIFLE in the context of interaction-based federated learning. Our code is open-sourced at https://github.com/dzungvpham/raifle.
