MODL: Multilearner Online Deep Learning
Antonios Valkanas, Boris N. Oreshkin, Mark Coates
TL;DR
MODL tackles online learning with streaming data and missing features by proposing a multilearner stacking framework that blends a fast online logistic regression with slower deep components, including a ProtoRes-based set learner. The core idea is to aggregate latent scores from diverse learners via a summation and a final softmax to produce predictions, avoiding hedge backpropagation and achieving faster convergence than prior online deep-learning methods. Empirical results on eight benchmarks show significant improvements in accuracy and training efficiency, with large reductions in per-step computation from $O(nL^2)$ to $O(nL)$ and robust performance across varying feature availability. The work demonstrates practical impact for real-time systems by delivering scalable online deep learning with strong empirical performance and accessible code.
Abstract
Online deep learning tackles the challenge of learning from data streams by balancing two competing goals: fast learning and deep learning. However, existing research primarily emphasizes deep learning solutions, which are more adept at handling the ``deep'' aspect than the ``fast'' aspect of online learning. In this work, we introduce an alternative paradigm through a hybrid multilearner approach. We begin by developing a fast online logistic regression learner, which operates without relying on backpropagation. It leverages closed-form recursive updates of model parameters, efficiently addressing the fast learning component of the online learning challenge. This approach is further integrated with a cascaded multilearner design, where shallow and deep learners are co-trained in a cooperative, synergistic manner to solve the online learning problem. We demonstrate that this approach achieves state-of-the-art performance on standard online learning datasets. We make our code available: https://github.com/AntonValk/MODL
