The Entrapment Problem in Random Walk Decentralized Learning
Zonghong Liu, Salim El Rouayheb, Matthew Dwyer
TL;DR
The paper addresses decentralized learning on graphs with heterogeneous data, where naive importance sampling via Metropolis-Hastings (MH) can entrap the random walk in high-importance nodes, slowing convergence. It proposes Metropolis-Hastings with Lévy jumps (MHLJ) to perturb the MH transitions and enable escape from entrapment, and provides a convergence-rate bound that separates the exploration effect (via mixing time) from the jump-induced bias (an error gap). Theoretical results are complemented by simulations on ring, grid, and Watts-Strogatz networks, showing that MHLJ speeds up convergence in sparse, heterogeneous settings and that the error gap can be controlled by adjusting the jump probability $p_J$. Overall, the work offers a principled approach to combine localized decentralized sampling with stochastic optimization while addressing exploration-exploitation trade-offs in networked data settings.
Abstract
This paper explores decentralized learning in a graph-based setting, where data is distributed across nodes. We investigate a decentralized SGD algorithm that utilizes a random walk to update a global model based on local data. Our focus is on designing the transition probability matrix to speed up convergence. While importance sampling can enhance centralized learning, its decentralized counterpart, using the Metropolis-Hastings (MH) algorithm, can lead to the entrapment problem, where the random walk becomes stuck at certain nodes, slowing convergence. To address this, we propose the Metropolis-Hastings with Lévy Jumps (MHLJ) algorithm, which incorporates random perturbations (jumps) to overcome entrapment. We theoretically establish the convergence rate and error gap of MHLJ and validate our findings through numerical experiments.
