On the Detectability of Active Gradient Inversion Attacks in Federated Learning

Vincenzo Carletti; Pasquale Foggia; Carlo Mazzocca; Giuseppe Parrella; Mario Vento

On the Detectability of Active Gradient Inversion Attacks in Federated Learning

Vincenzo Carletti, Pasquale Foggia, Carlo Mazzocca, Giuseppe Parrella, Mario Vento

TL;DR

The paper tackles privacy risks in Federated Learning by analyzing active gradient inversion attacks (GIAs) and their detectability. It provides a comprehensive assessment of four state-of-the-art active GIAs (two handcrafted and two learned) and introduces lightweight, client-side detectors based on weight-space anomalies and loss/gradient dynamics. Through extensive experiments on multiple datasets and architectures, the detectors demonstrate strong performance and robustness, even under partial participation, without requiring changes to the FL protocol. The work highlights practical defense implications for FL deployments and emphasizes the need to consider behavioral side-channels alongside gradient-space signals in adversarial settings.

Abstract

One of the key advantages of Federated Learning (FL) is its ability to collaboratively train a Machine Learning (ML) model while keeping clients' data on-site. However, this can create a false sense of security. Despite not sharing private data increases the overall privacy, prior studies have shown that gradients exchanged during the FL training remain vulnerable to Gradient Inversion Attacks (GIAs). These attacks allow reconstructing the clients' local data, breaking the privacy promise of FL. GIAs can be launched by either a passive or an active server. In the latter case, a malicious server manipulates the global model to facilitate data reconstruction. While effective, earlier attacks falling under this category have been demonstrated to be detectable by clients, limiting their real-world applicability. Recently, novel active GIAs have emerged, claiming to be far stealthier than previous approaches. This work provides the first comprehensive analysis of these claims, investigating four state-of-the-art GIAs. We propose novel lightweight client-side detection techniques, based on statistically improbable weight structures and anomalous loss and gradient dynamics. Extensive evaluation across several configurations demonstrates that our methods enable clients to effectively detect active GIAs without any modifications to the FL training protocol.

On the Detectability of Active Gradient Inversion Attacks in Federated Learning

TL;DR

Abstract

On the Detectability of Active Gradient Inversion Attacks in Federated Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)