Stochastic gradient descent for streaming linear and rectified linear systems with adversarial corruptions
Halyun Jeong, Deanna Needell, Elizaveta Rebrova
TL;DR
This work addresses robust streaming regression for both linear and ReLU models under Massart semi-random corruptions. It introduces SGD-exp, an SGD variant with exponential decay step sizes, and proves nearly linear convergence: $\|\mathbf{x}-\mathbf{x}_T\|_2 \lesssim \log T \exp\left(-\frac{T}{d \log^2 T}\right)$ with high probability for corruption probability $p<\tfrac{1}{2}$, and extending to symmetric oblivious corruptions up to $p<1$. The analysis hinges on drift arguments for a transformed residual process, plus a drift-based MGF bound, delivering the first convergence guarantees for streaming robust ReLU regression under Massart noise. Experiments on synthetic and real datasets corroborate the theory, showing fast, robust recovery under various corruptions and demonstrating practical viability. The approach offers a practical and theoretically-grounded tool for robust streaming regression with broad implications for real-time learning under adversarial or semi-random data corruptions.
Abstract
We propose SGD-exp, a stochastic gradient descent approach for linear and ReLU regressions under Massart noise (adversarial semi-random corruption model) for the fully streaming setting. We show novel nearly linear convergence guarantees of SGD-exp to the true parameter with up to $50\%$ Massart corruption rate, and with any corruption rate in the case of symmetric oblivious corruptions. This is the first convergence guarantee result for robust ReLU regression in the streaming setting, and it shows the improved convergence rate over previous robust methods for $L_1$ linear regression due to a choice of an exponentially decaying step size, known for its efficiency in practice. Our analysis is based on the drift analysis of a discrete stochastic process, which could also be interesting on its own.
