Decentralized Kernel Ridge Regression Based on Data-Dependent Random Feature
Ruikai Yang, Fan He, Mingzhen He, Jie Yang, Xiaolin Huang
TL;DR
This paper tackles decentralized kernel ridge regression under data heterogeneity by introducing data-dependent random features (DDRF) that allow each node to use its own RFs while pursuing consensus on decision functions rather than on feature coefficients. The authors formulate a relaxed, convex objective that couples neighboring nodes via alignment penalties and derive an efficient primal-only update that updates node-level coefficients with precomputed local matrices, achieving the same communication costs as competing methods. They prove convergence under a condition on the self-penalty and validate the approach with experiments on six real-world datasets, showing substantial accuracy gains (e.g., average improvements of 25.5% over data-independent baselines) especially when data distributions differ across nodes and when feature budgets vary. The method, DeKRR-DDRF, offers a flexible, privacy-preserving, communication-efficient framework for decentralized kernel learning that adapts to node-specific data while maintaining network-wide performance gains.
Abstract
Random feature (RF) has been widely used for node consistency in decentralized kernel ridge regression (KRR). Currently, the consistency is guaranteed by imposing constraints on coefficients of features, necessitating that the random features on different nodes are identical. However, in many applications, data on different nodes varies significantly on the number or distribution, which calls for adaptive and data-dependent methods that generate different RFs. To tackle the essential difficulty, we propose a new decentralized KRR algorithm that pursues consensus on decision functions, which allows great flexibility and well adapts data on nodes. The convergence is rigorously given and the effectiveness is numerically verified: by capturing the characteristics of the data on each node, while maintaining the same communication costs as other methods, we achieved an average regression accuracy improvement of 25.5\% across six real-world data sets.
