Federated Smoothing Proximal Gradient for Quantile Regression with Non-Convex Penalties

Reza Mirzaeifard; Diyako Ghaderyan; Stefan Werner

Federated Smoothing Proximal Gradient for Quantile Regression with Non-Convex Penalties

Reza Mirzaeifard, Diyako Ghaderyan, Stefan Werner

TL;DR

This work tackles federated quantile regression on decentralized IoT data under privacy constraints, where the objective combines non-convex penalties (MCP/SCAD) with a non-smooth check loss. It introduces the Federated Smoothing Proximal Gradient (FSPG) method, which replaces non-smooth components with smooth surrogates and couples local gradient steps with a central proximal update, guided by a time-varying penalty and smoothing parameter that shrink over iterations. The authors prove convergence to a stationary point, establishing descent, subgradient bounds, and rates such as ||w^{(k+1)}-w^{(k)}||_2^2 = o(k^{-1-d}) and ||kappa^{(k+1)}||_2 = o(k^{-1/2+d/2}); they also demonstrate that the smoothing parameter mu -> 0 yields convergence of tilde g gradients to the original subgradients. Empirical results across synthetic and real datasets show that FSPG achieves faster convergence and more accurate sparse recovery than competing federated methods, with robust performance under varying sparsity and data distributions, highlighting its practical impact for reliable, privacy-preserving distributed learning.

Abstract

Distributed sensors in the internet-of-things (IoT) generate vast amounts of sparse data. Analyzing this high-dimensional data and identifying relevant predictors pose substantial challenges, especially when data is preferred to remain on the device where it was collected for reasons such as data integrity, communication bandwidth, and privacy. This paper introduces a federated quantile regression algorithm to address these challenges. Quantile regression provides a more comprehensive view of the relationship between variables than mean regression models. However, traditional approaches face difficulties when dealing with nonconvex sparse penalties and the inherent non-smoothness of the loss function. For this purpose, we propose a federated smoothing proximal gradient (FSPG) algorithm that integrates a smoothing mechanism with the proximal gradient framework, thereby enhancing both precision and computational speed. This integration adeptly handles optimization over a network of devices, each holding local data samples, making it particularly effective in federated learning scenarios. The FSPG algorithm ensures steady progress and reliable convergence in each iteration by maintaining or reducing the value of the objective function. By leveraging nonconvex penalties, such as the minimax concave penalty (MCP) and smoothly clipped absolute deviation (SCAD), the proposed method can identify and preserve key predictors within sparse models. Comprehensive simulations validate the robust theoretical foundations of the proposed algorithm and demonstrate improved estimation precision and reliable convergence.

Federated Smoothing Proximal Gradient for Quantile Regression with Non-Convex Penalties

TL;DR

Abstract

Paper Structure (10 sections, 8 theorems, 41 equations, 6 figures, 1 algorithm)

This paper contains 10 sections, 8 theorems, 41 equations, 6 figures, 1 algorithm.

Introduction
Preliminaries
Sparse Quantile Regression Framework
Smoothing Approximation
Federated Smoothing Proximal Gradient for Penalized Quantile Regression
Convergence Proof
Simulation Results
Simulation Setup
Results
Conclusion

Key Result

Lemma 1

The function $\Phi_\sigma\left(\mathbf{w},\mathbf{w}',\mu\right) = \sum_{l=1}^{L} \tilde{g}_l\mathopen{}\left(\mathbf{w}',\mu\right)\mathclose{} + n P_{\lambda,\gamma} (\mathbf{w})+\sigma\left\|\mathbf{w}-\mathbf{w}'\right\|_2^2$ is lower bounded.

Figures (6)

Figure 1: MSE versus iterations.
Figure 2: Accuracy of correctly recognizing active and non-active coefficients.
Figure 3: MSE versus the number of active coefficients $s$ in model parameter $\boldsymbol{\beta}_{\tau} \in \mathbb{R}^P$.
Figure 4: MSE versus iterations for FSPG, FHPG, and non-cooperation scenario.
Figure 5: MSE versus iterations for different $d$.
...and 1 more figures

Theorems & Definitions (19)

Lemma 1
proof
Lemma 2
proof
Lemma 3
proof
Theorem 1: Sufficient Descent Property
proof
Remark 1
Lemma 4
...and 9 more

Federated Smoothing Proximal Gradient for Quantile Regression with Non-Convex Penalties

TL;DR

Abstract

Federated Smoothing Proximal Gradient for Quantile Regression with Non-Convex Penalties

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (19)