Progressive Supervision via Label Decomposition: An Long-Term and Large-Scale Wireless Traffic Forecasting Method
Daojun Liang, Haixia Zhang, Dongfeng Yuan
TL;DR
This work tackles the challenge of long-term and large-scale wireless traffic forecasting (LL-WTF) on city-scale graphs by introducing Random Subgraph Sampling (RSS) to enable scalable training and Progressive Supervision via Label Decomposition (PSLD) to address non-stationarity. PSLD decomposes the label into multiple easier components (via Mean-Variance Decomposition or STL), which are learned progressively at shallow layers and fused at deeper layers, using a decomposer-learner-predictor-combiner architecture with a joint loss. Empirical results on three large WT datasets show PSLD achieves state-of-the-art performance (average improvements around 2%, 4%, and 11% over baselines on the three datasets) while maintaining efficient inference; an open-source WT forecasting library WTFlib is released to facilitate replication and benchmarking. Overall, RSS and PSLD provide a scalable, robust, and interpretable framework for LL-WTF and are applicable as versatile enhancements to other forecasting models, with future work aiming to integrate more advanced nonlinear backbones.
Abstract
Long-term and Large-scale Wireless Traffic Forecasting (LL-WTF) is pivotal for strategic network management and comprehensive planning on a macro scale. However, LL-WTF poses greater challenges than short-term ones due to the pronounced non-stationarity of extended wireless traffic and the vast number of nodes distributed at the city scale. To cope with this, we propose a Progressive Supervision method based on Label Decomposition (PSLD). Specifically, we first introduce a Random Subgraph Sampling (RSS) algorithm designed to sample a tractable subset from large-scale traffic data, thereby enabling efficient network training. Then, PSLD employs label decomposition to obtain multiple easy-to-learn components, which are learned progressively at shallow layers and combined at deep layers to effectively cope with the non-stationary problem raised by LL-WTF tasks. Finally, we compare the proposed method with various state-of-the-art (SOTA) methods on three large-scale WT datasets. Extensive experimental results demonstrate that the proposed PSLD significantly outperforms existing methods, with an average 2%, 4%, and 11% performance improvement on three WT datasets, respectively. In addition, we built an open source library for WT forecasting (WTFlib) to facilitate related research, which contains numerous SOTA methods and provides a strong benchmark.Experiments can be reproduced through https://github.com/Anoise/WTFlib.
