Random Walk Learning and the Pac-Man Attack
Xingran Chen, Parimal Parag, Rohit Bhagat, Zonghong Liu, Salim El Rouayheb
TL;DR
This work tackles the vulnerability of random-walk based decentralized learning to a stealthy Pac-Man attack that terminates incoming random walks with probability $\zeta$. It proposes the Average Crossing (AC) algorithm to locally duplicate RWs, ensuring a bounded RW population and enabling continued learning via RW-SGD. Theoretical results show almost-sure boundedness of RW population, a phase transition in extinction probability tied to the duplication threshold, and convergence of the chain of RWs to a surrogate optimization problem with bounded bias from the true optimum. Empirical results on synthetic and real datasets—including MNIST—validate the theory, reveal a soft phase transition, and demonstrate competitive convergence and robustness against Pac-Man, highlighting the practical viability of fully decentralized defenses in RW-based learning systems.
Abstract
Random walk (RW)-based algorithms have long been popular in distributed systems due to low overheads and scalability, with recent growing applications in decentralized learning. However, their reliance on local interactions makes them inherently vulnerable to malicious behavior. In this work, we investigate an adversarial threat that we term the ``Pac-Man'' attack, in which a malicious node probabilistically terminates any RW that visits it. This stealthy behavior gradually eliminates active RWs from the network, effectively halting the learning process without triggering failure alarms. To counter this threat, we propose the Average Crossing (AC) algorithm--a fully decentralized mechanism for duplicating RWs to prevent RW extinction in the presence of Pac-Man. Our theoretical analysis establishes that (i) the RW population remains almost surely bounded under AC and (ii) RW-based stochastic gradient descent remains convergent under AC, even in the presence of Pac-Man, with a quantifiable deviation from the true optimum. Our extensive empirical results on both synthetic and real-world datasets corroborate our theoretical findings. Furthermore, they uncover a phase transition in the extinction probability as a function of the duplication threshold. We offer theoretical insights by analyzing a simplified variant of the AC, which sheds light on the observed phase transition.
