High-Dimensional Distributed Sparse Classification with Scalable Communication-Efficient Global Updates
Fred Lu, Ryan R. Curtin, Edward Raff, Francis Ferraro, James Holt
TL;DR
This work addresses the challenge of scalable distributed training for high-dimensional sparse logistic regression by developing proxCSL, a proximal Newton-based solver for the communication-efficient surrogate likelihood framework. By incorporating adaptive proximal regularization (adaptive $\alpha$) and a Hessian-free proximal Newton approach with Hessian caching, proxCSL stabilizes updates, preserves sparsity, and achieves fast convergence under realistic high-dimensional regimes. Theoretical guarantees rely on restricted strong convexity and restricted Lipschitz Hessian, showing convergence and error bounds, while empirical results demonstrate superior accuracy and competitive runtimes across single-node and multi-node settings, often matching full-data solutions after just two updates. The method significantly advances practical distributed optimization for sparse models and offers a scalable path for learning in datasets with millions of features.
Abstract
As the size of datasets used in statistical learning continues to grow, distributed training of models has attracted increasing attention. These methods partition the data and exploit parallelism to reduce memory and runtime, but suffer increasingly from communication costs as the data size or the number of iterations grows. Recent work on linear models has shown that a surrogate likelihood can be optimized locally to iteratively improve on an initial solution in a communication-efficient manner. However, existing versions of these methods experience multiple shortcomings as the data size becomes massive, including diverging updates and efficiently handling sparsity. In this work we develop solutions to these problems which enable us to learn a communication-efficient distributed logistic regression model even beyond millions of features. In our experiments we demonstrate a large improvement in accuracy over distributed algorithms with only a few distributed update steps needed, and similar or faster runtimes. Our code is available at \url{https://github.com/FutureComputing4AI/ProxCSL}.
