Attribute Inference Attacks for Federated Regression Tasks
Francesco Diana, Othmane Marfoq, Chuan Xu, Giovanni Neglia, Frédéric Giroire, Eoin Thomas
TL;DR
The paper addresses privacy leakage in federated regression by proposing a model-based AIA that first reconstructs a targeted client’s optimal local model and then applies a model-based attribute inference on that model. It provides a theoretical lower bound for AIA accuracy in least squares regression and demonstrates that model-based attacks can surpass gradient-based approaches, especially under data heterogeneity and active adversaries. Experiments on Medical and Income datasets show substantial improvements over state-of-the-art gradient-based AIAs, with DP-SGD offering incomplete protection. The work highlights practical privacy risks in FL regression and motivates the development of stronger defenses that go beyond standard differential privacy alone.
Abstract
Federated Learning (FL) enables multiple clients, such as mobile phones and IoT devices, to collaboratively train a global machine learning model while keeping their data localized. However, recent studies have revealed that the training phase of FL is vulnerable to reconstruction attacks, such as attribute inference attacks (AIA), where adversaries exploit exchanged messages and auxiliary public information to uncover sensitive attributes of targeted clients. While these attacks have been extensively studied in the context of classification tasks, their impact on regression tasks remains largely unexplored. In this paper, we address this gap by proposing novel model-based AIAs specifically designed for regression tasks in FL environments. Our approach considers scenarios where adversaries can either eavesdrop on exchanged messages or directly interfere with the training process. We benchmark our proposed attacks against state-of-the-art methods using real-world datasets. The results demonstrate a significant increase in reconstruction accuracy, particularly in heterogeneous client datasets, a common scenario in FL. The efficacy of our model-based AIAs makes them better candidates for empirically quantifying privacy leakage for federated regression tasks.
