Gradient Inversion in Federated Reinforcement Learning

Shenghong He

Gradient Inversion in Federated Reinforcement Learning

Shenghong He

TL;DR

The paper investigates privacy risks in Federated Reinforcement Learning by introducing RGIA, a gradient inversion attack augmented with prior-based regularizations on states, rewards, and transition dynamics to enforce environment plausibility. RGIA reduces the pseudo-solution problem and narrows the feasible reconstruction space, with theoretical guarantees and empirical validation across diverse control and driving tasks showing accurate recovery of local training data from shared gradients. The work demonstrates that naive gradient-based attacks can be mitigated by priors, but defenses such as differential privacy introduce trade-offs between privacy and policy performance, while HE and gradient quantization offer limited protection. Overall, RGIA exposes FRL privacy vulnerabilities, provides a framework for evaluating defenses, and motivates robust privacy-preserving mechanisms in distributed RL systems.

Abstract

Federated reinforcement learning (FRL) enables distributed learning of optimal policies while preserving local data privacy through gradient sharing.However, FRL faces the risk of data privacy leaks, where attackers exploit shared gradients to reconstruct local training data.Compared to traditional supervised federated learning, successful reconstruction in FRL requires the generated data not only to match the shared gradients but also to align with real transition dynamics of the environment (i.e., aligning with the real data transition distribution).To address this issue, we propose a novel attack method called Regularization Gradient Inversion Attack (RGIA), which enforces prior-knowledge-based regularization on states, rewards, and transition dynamics during the optimization process to ensure that the reconstructed data remain close to the true transition distribution.Theoretically, we prove that the prior-knowledge-based regularization term narrows the solution space from a broad set containing spurious solutions to a constrained subset that satisfies both gradient matching and true transition dynamics.Extensive experiments on control tasks and autonomous driving tasks demonstrate that RGIA can effectively constrain reconstructed data transition distributions and thus successfully reconstruct local private data.

Gradient Inversion in Federated Reinforcement Learning

TL;DR

Abstract

Gradient Inversion in Federated Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (11)