Peer-to-Peer Deep Learning for Beyond-5G IoT
Srinivasa Pranav, José M. F. Moura
TL;DR
This work tackles the scalability and privacy limitations of server-based federated learning in beyond-5G IoT by introducing P2PL, a fully distributed device-to-device deep learning framework. It combines a max norm synchronization initialization with alternating on-device training and neighbor consensus, using device-specific mixing weights to accommodate nonuniform data and network topology, and demonstrates that 100 devices can achieve test performance on par with federated and centralized training. Key contributions include showing faster convergence with max norm synchronization, relaxing doubly stochastic mixing assumptions, and proving robustness to diverse graphs, intermittent communications, and non-IID data distributions. The approach promises privacy-preserving, scalable DL for large, heterogeneous IoT networks without centralized servers, reducing latency and single points of failure while maintaining performance.
Abstract
We present P2PL, a practical multi-device peer-to-peer deep learning algorithm that, unlike the federated learning paradigm, does not require coordination from edge servers or the cloud. This makes P2PL well-suited for the sheer scale of beyond-5G computing environments like smart cities that otherwise create range, latency, bandwidth, and single point of failure issues for federated approaches. P2PL introduces max norm synchronization to catalyze training, retains on-device deep model training to preserve privacy, and leverages local inter-device communication to implement distributed consensus. Each device iteratively alternates between two phases: 1) on-device learning and 2) peer-to-peer cooperation where they combine model parameters with nearby devices. We empirically show that all participating devices achieve the same test performance attained by federated and centralized training -- even with 100 devices and relaxed singly stochastic consensus weights. We extend these experimental results to settings with diverse network topologies, sparse and intermittent communication, and non-IID data distributions.
