Supervised Learning with Evolving Tasks and Performance Guarantees
Verónica Álvarez, Santiago Mazuelas, Jose A. Lozano
TL;DR
This work addresses learning across sequences of evolving classification tasks, unifying batch and online settings (MDA, MTL, SCD, CL) under a single, robust framework. It introduces minimax risk classifiers with uncertainty sets defined by expectations of a feature mapping, and develops forward (Kalman-filter-like) and backward (RTS smoother) recursions to estimate task-mean statistics and their uncertainties across multidimensional task changes. Theoretical results characterize the effective sample size (ESS) gains and furnish computable, tight performance guarantees for the error probability on each task, while extending the approach to higher-order dependencies. Empirical results on diverse benchmarks validate the multidimensional adaptation and the guarantees, showing consistent improvements over state-of-the-art baselines in both batch and online scenarios. The methodology paves the way for principled, scalable transfer across evolving task environments with rigorous performance assurances.
Abstract
Multiple supervised learning scenarios are composed by a sequence of classification tasks. For instance, multi-task learning and continual learning aim to learn a sequence of tasks that is either fixed or grows over time. Existing techniques for learning tasks that are in a sequence are tailored to specific scenarios, lacking adaptability to others. In addition, most of existing techniques consider situations in which the order of the tasks in the sequence is not relevant. However, it is common that tasks in a sequence are evolving in the sense that consecutive tasks often have a higher similarity. This paper presents a learning methodology that is applicable to multiple supervised learning scenarios and adapts to evolving tasks. Differently from existing techniques, we provide computable tight performance guarantees and analytically characterize the increase in the effective sample size. Experiments on benchmark datasets show the performance improvement of the proposed methodology in multiple scenarios and the reliability of the presented performance guarantees.
