List-Decodable Regression via Expander Sketching
Herbod Pourali, Sajjad Hashemian, Ebrahim Ardeshir-Larijani
TL;DR
This paper tackles robust list-decodable linear regression when a large fraction of samples may be adversarially corrupted. It introduces an expander-sketching pipeline that synthesizes lightly contaminated buckets via lossless expanders, followed by robust aggregation and spectral filtering to recover the regression direction. The method achieves near-optimal rates with sample complexity tilde-O((d+log(1/delta))/alpha), list size O(1/alpha), and near input-sparsity runtime tilde-O(nnz(X)+d^3/alpha), without relying on explicit batch structure or SoS techniques. Theoretical analysis ties isolation properties of expanders to moment concentration and perturbation bounds, and experiments demonstrate strong robustness across varying contamination and a real-data stress test. Overall, the work shows how combinatorial sketching can bypass SQ barriers and enable efficient, resistant learning in adversarial settings with practical impact.
Abstract
We introduce an expander-sketching framework for list-decodable linear regression that achieves sample complexity $\tilde{O}((d+\log(1/δ))/α)$, list size $O(1/α)$, and near input-sparsity running time $\tilde{O}(\mathrm{nnz}(X)+d^{3}/α)$ under standard sub-Gaussian assumptions. Our method uses lossless expanders to synthesize lightly contaminated batches, enabling robust aggregation and a short spectral filtering stage that matches the best known efficient guarantees while avoiding SoS machinery and explicit batch structure.
