Algorithm Design for Continual Learning in IoT Networks
Shugang Hao, Lingjie Duan
TL;DR
This work tackles continual learning in IoT networks where a mobile agent can route data collection to alter the training task sequence. It derives closed-form expressions for forgetting loss under both underparameterized and overparameterized regimes, proves the problem is NP-hard via a shortest Hamiltonian path reduction, and proposes a polynomial-time approximation achieving ratios $\frac{3}{2}$ (underparameterized) and $\frac{3}{2} + r^{1-T}$ (overparameterized) with $r = 1 - \frac{n}{m}$. The authors design a two-layer algorithm that first selects a task order and then learns region predictors, with a Christofides-based SHP step to guarantee performance bounds, and they validate the approach through simulations showing close-to-optimal performance and substantial improvements over baselines. The results enable practical deployment of CL in dynamic IoT scenarios by balancing forgetting across tasks with travel costs, offering provable guarantees and insights into regime-dependent behavior as the feature-to-sample ratio changes. The work also highlights how problem structure—route planning combined with model updating—drives new approximation techniques in continual learning contexts.
Abstract
Continual learning (CL) is a new online learning technique over sequentially generated streaming data from different tasks, aiming to maintain a small forgetting loss on previously-learned tasks. Existing work focuses on reducing the forgetting loss under a given task sequence. However, if similar tasks continuously appear to the end time, the forgetting loss is still huge on prior distinct tasks. In practical IoT networks, an autonomous vehicle to sample data and learn different tasks can route and alter the order of task pattern at increased travelling cost. To our best knowledge, we are the first to study how to opportunistically route the testing object and alter the task sequence in CL. We formulate a new optimization problem and prove it NP-hard. We propose a polynomial-time algorithm to achieve approximation ratios of $\frac{3}{2}$ for underparameterized case and $\frac{3}{2} + r^{1-T}$ for overparameterized case, respectively, where $r:=1-\frac{n}{m}$ is a parameter of feature number $m$ and sample number $n$ and $T$ is the task number. Simulation results verify our algorithm's close-to-optimum performance.
