Cross-Learning from Scarce Data via Multi-Task Constrained Optimization
Leopoldo Agorio, Juan Cerviño, Miguel Calvo-Fullana, Alejandro Ribeiro, Juan Andrés Bazerque
TL;DR
The paper targets data-scarce supervised learning by proposing cross-learning, a constrained multi-task framework that jointly estimates task-specific parameters while enforcing controlled similarity across tasks under a deterministic-parameter regime. It introduces both parametric-constraint and coupled-outputs formulations, and provides theoretical guarantees (under Gaussian data) that there exists a centrality level $\epsilon$ yielding lower mean-squared error than fully separate or fully shared models. To solve these problems, the authors develop ADMM-based and primal-dual algorithms, and validate the approach on real data: COVID-19 SIR model fitting across countries and Office-Home image classification, where cross-learning improves peak prediction accuracy and classification performance relative to baselines. The work demonstrates that sharing information across related tasks can reduce data requirements and improve reliability, with broad applicability to domains like epidemiology and computer vision, while allowing task-specificity to be preserved through tunable similarity constraints.
Abstract
A learning task, understood as the problem of fitting a parametric model from supervised data, fundamentally requires the dataset to be large enough to be representative of the underlying distribution of the source. When data is limited, the learned models fail generalize to cases not seen during training. This paper introduces a multi-task \emph{cross-learning} framework to overcome data scarcity by jointly estimating \emph{deterministic} parameters across multiple, related tasks. We formulate this joint estimation as a constrained optimization problem, where the constraints dictate the resulting similarity between the parameters of the different models, allowing the estimated parameters to differ across tasks while still combining information from multiple data sources. This framework enables knowledge transfer from tasks with abundant data to those with scarce data, leading to more accurate and reliable parameter estimates, providing a solution for scenarios where parameter inference from limited data is critical. We provide theoretical guarantees in a controlled framework with Gaussian data, and show the efficiency of our cross-learning method in applications with real data including image classification and propagation of infectious diseases.
