Adaptive prediction theory combining offline and online learning
Haizheng Li, Lei Guo
TL;DR
The paper develops a rigorous two-stage framework that fuses offline nonlinear-least-squares learning with online meta-LMS adaptation to handle distribution shifts and parameter drift in nonlinear stochastic dynamical systems. It provides a Generalization Lemma giving KL-divergence–based bounds on offline generalization, and introduces a meta-LMS online method to track drifting parameters, with theoretical guarantees showing near-optimal prediction performance under typical conditions. Corollaries cover common time-series and control settings, and the results are complemented by proofs and numerical simulations illustrating improved transient behavior over purely offline or online methods. This work offers a principled foundation for reliable, scalable offline-online learning in safety-critical dynamic environments.
Abstract
Real-world intelligence systems usually operate by combining offline learning and online adaptation with highly correlated and non-stationary system data or signals, which, however, has rarely been investigated theoretically in the literature. This paper initiates a theoretical investigation on the prediction performance of a two-stage learning framework combining offline and online algorithms for a class of nonlinear stochastic dynamical systems. For the offline-learning phase, we establish an upper bound on the generalization error for approximate nonlinear-least-squares estimation under general datasets with strong correlation and distribution shift, leveraging the Kullback-Leibler divergence to quantify the distributional discrepancies. For the online-adaptation phase, we address, on the basis of the offline-trained model, the possible uncertain parameter drift in real-world target systems by proposing a meta-LMS prediction algorithm. This two-stage framework, integrating offline learning with online adaptation, demonstrates superior prediction performances compared with either purely offline or online methods. Both theoretical guarantees and empirical studies are provided.
