Distributed Experimental Design Networks
Yuanyuan Li, Lili Su, Carlee Joe-Wong, Edmund Yeh, Stratis Ioannidis
TL;DR
The paper addresses rate-allocation for data streams in distributed experimental design networks with multicast constraints, proving the learning-quality objective is non-convex yet continuous DR-submodular. It gives a centralized FW-based algorithm with a $1-1/e$ guarantee and a scalable distributed primal-dual FW variant that preserves this guarantee under a differentiable constraint relaxation. The work extends to Gaussian sources and heterogeneous noise, providing unbiased gradient estimators with probabilistic error bounds and extensive simulations on synthetic and backbone topologies showing near-centralized performance. The results demonstrate that multicast-based design substantially improves throughput and model quality, offering practical avenues for edge-enabled distributed learning. Limitations include synchronization requirements, motivating future work on asynchronous schemes and shadow-price-based gradient estimation, along with publicly available code and data.
Abstract
As edge computing capabilities increase, model learning deployments in diverse edge environments have emerged. In experimental design networks, introduced recently, network routing and rate allocation are designed to aid the transfer of data from sensors to heterogeneous learners. We design efficient experimental design network algorithms that are (a) distributed and (b) use multicast transmissions. This setting poses significant challenges as classic decentralization approaches often operate on (strictly) concave objectives under differentiable constraints. In contrast, the problem we study here has a non-convex, continuous DR-submodular objective, while multicast transmissions naturally result in non-differentiable constraints. From a technical standpoint, we propose a distributed Frank-Wolfe and a distributed projected gradient ascent algorithm that, coupled with a relaxation of non-differentiable constraints, yield allocations within a $1-1/e$ factor from the optimal. Numerical evaluations show that our proposed algorithms outperform competitors with respect to model learning quality.
