Distributed Networked Multi-task Learning
Lingzhou Hong, Alfredo Garcia
TL;DR
This work develops a Distributed and Asynchronous algorithm for Multi-task Learning (DAMTL) that operates over a directed network partitioned into groups. By formulating a bi-level optimization with inner task-model estimation $\mathbf{W}$ and outer task-relationship precision $\boldsymbol{\Theta}$, the authors establish a two-timescale, asynchronous SGD framework with messengers to propagate cross-group information while preserving local computation; continuous-time approximations yield finite-time convergence guarantees for both inner and outer problems. The approach accounts for heterogeneous and correlated data streams and provides explicit conditions and step-size choices to ensure stability and bounded error in both the parameter estimates and the learned task-relationship matrix. Numerical experiments on synthetic Gaussian MRF temperature fields and real student-performance data demonstrate faster convergence and robustness of DAMTL compared to baseline approaches, illustrating its scalability and applicability to distributed, privacy-preserving learning scenarios.
Abstract
We consider a distributed multi-task learning scheme that accounts for multiple linear model estimation tasks with heterogeneous and/or correlated data streams. We assume that nodes can be partitioned into groups corresponding to different learning tasks and communicate according to a directed network topology. Each node estimates a linear model asynchronously and is subject to local (within-group) regularization and global (across groups) regularization terms targeting noise reduction and generalization performance improvement respectively. We provide a finite-time characterization of convergence of the estimators and task relation and illustrate the scheme's general applicability in two examples: random field temperature estimation and modeling student performance from different academic districts.
