Online Regularized Statistical Learning in Reproducing Kernel Hilbert Space With Non-Stationary Data
Xiwei Zhang, Yan Chen, Tao Li
TL;DR
This work addresses online statistical learning in RKHS under non-stationary data by introducing a random Tikhonov regularization path and recasting the problem as solving a random ill-posed inverse problem with time-varying forward operators $T_k$. It develops a two-term error decomposition (multiplicative noise and regularization-path drift) and proves mean-square convergence of the online output to the unknown function when the regularization path drifts slowly and the RKHS persistence of excitation condition holds, even without data independence. A specific case with independent non-identically distributed data is analyzed, showing that mild drift in input marginals suffices for consistency. Numerical experiments validate convergence and illustrate advantages over KLMS and NORMA in non-stationary environments. The results provide a rigorous framework for tracking time-varying targets in infinite-dimensional RKHS settings with non-iid data streams.
Abstract
We study recursive regularized learning algorithms in the reproducing kernel Hilbert space (RKHS) with non-stationary online data streams. We introduce the concept of random Tikhonov regularization path and decompose the tracking error of the algorithm's output for the regularization path into random difference equations in RKHS. We show that the tracking error vanishes in mean square if the regularization path is slowly time-varying. Then, leveraging the monotonicity of inverse operators and the spectral decomposition of compact operators, and introducing the RKHS persistence of excitation condition, we develop a dominated convergence method to prove the mean square consistency between the regularization path and the unknown function to be learned. Especially, for independent and non-identically distributed data streams, the mean square consistency between the algorithm's output and the unknown function is achieved if the input data's marginal probability measures are slowly time-varying and the average measure over each fixed-length time period has a uniformly strictly positive lower bound.
