Efficient Large-Scale Learning of Minimax Risk Classifiers
Kartheek Bondugula, Santiago Mazuelas, Aritz Pérez
TL;DR
The paper tackles learning minimax risk classifiers (MRCs) for large-scale, multi-class tasks where the objective is the worst-case risk over an uncertainty set. It introduces a constraint-and-column generation algorithm (MRC-CCG) that solves the LP formulation of MRCs by iteratively updating small subsets of constraints and features, with greedy primal/dual updates and efficient constraint evaluation. Theoretical results establish monotone convergence and bounds on the solution quality under violation thresholds, while empirical results show up to 10x speedups on large-scale data and up to 100x speedups as the number of classes grows, without compromising accuracy. This approach enables robust, scalable MRC learning for real-world, high-dimensional, multi-class problems, offering practical performance guarantees and significant computational benefits.
Abstract
Supervised learning with large-scale data usually leads to complex optimization problems, especially for classification tasks with multiple classes. Stochastic subgradient methods can enable efficient learning with a large number of samples for classification techniques that minimize the average loss over the training samples. However, recent techniques, such as minimax risk classifiers (MRCs), minimize the maximum expected loss and are not amenable to stochastic subgradient methods. In this paper, we present a learning algorithm based on the combination of constraint and column generation that enables efficient learning of MRCs with large-scale data for classification tasks with multiple classes. Experiments on multiple benchmark datasets show that the proposed algorithm provides upto a 10x speedup for general large-scale data and around a 100x speedup with a sizeable number of classes.
