Provably Robust Federated Reinforcement Learning
Minghong Fang, Xilong Wang, Neil Zhenqiang Gong
TL;DR
The paper tackles poisoning in federated reinforcement learning by introducing a Normalized attack that maximizes angular deviation between pre- and post-attack aggregated updates, challenging existing Byzantine-robust defenses. It then proposes an ensemble FRL defense that trains multiple global policies over disjoint agent groups and combines their test-time actions via majority voting for discrete actions or the geometric median for continuous actions, with formal guarantees under certain threat thresholds. Empirical results across Cart Pole, Lunar Lander, and Inverted Pendulum show the Normalized attack can significantly disrupt non-ensemble, robust FRL rules, while the ensemble approach preserves near non-attack performance and provides strong resilience against both existing and the new attack. The work advances FRL security by linking a novel attack to a practical, provably robust ensemble defense, with implications for safer multi-agent learning in privacy-preserving, distributed environments.
Abstract
Federated reinforcement learning (FRL) allows agents to jointly learn a global decision-making policy under the guidance of a central server. While FRL has advantages, its decentralized design makes it prone to poisoning attacks. To mitigate this, Byzantine-robust aggregation techniques tailored for FRL have been introduced. Yet, in our work, we reveal that these current Byzantine-robust techniques are not immune to our newly introduced Normalized attack. Distinct from previous attacks that targeted enlarging the distance of policy updates before and after an attack, our Normalized attack emphasizes on maximizing the angle of deviation between these updates. To counter these threats, we develop an ensemble FRL approach that is provably secure against both known and our newly proposed attacks. Our ensemble method involves training multiple global policies, where each is learnt by a group of agents using any foundational aggregation rule. These well-trained global policies then individually predict the action for a specific test state. The ultimate action is chosen based on a majority vote for discrete action systems or the geometric median for continuous ones. Our experimental results across different settings show that the Normalized attack can greatly disrupt non-ensemble Byzantine-robust methods, and our ensemble approach offers substantial resistance against poisoning attacks.
