Beyond Covariance Matrix: The Statistical Complexity of Private Linear Regression
Fan Chen, Jiachun Li, Alexander Rakhlin, David Simchi-Levi
TL;DR
This work develops a minimax theory for private linear regression under general covariate distributions and reveals that privacy complexity is governed by an $L_1$-analogue of the Fisher information, not the usual covariance. It introduces Information-Weighted Regression, computable in both Local and Global DP, and proves $L_1$ convergence and distribution-specific minimax optimality. The framework extends to dimension-free settings with near-minimax performance, and is applied to private linear contextual bandits to achieve rate-optimal regret under both joint and local privacy, addressing open questions about privacy-utility trade-offs. Overall, the approach unifies privacy-aware complexity via the matrices $oldsymbol{U}_{oldsymbol{ ext{λ}}}$ and $oldsymbol{W}_{oldsymbol{ ext{γ,λ}}}$, yielding practical, near-optimal private-learning and decision-making tools across DP settings.
Abstract
We study the statistical complexity of private linear regression under an unknown, potentially ill-conditioned covariate distribution. Somewhat surprisingly, under privacy constraints the intrinsic complexity is \emph{not} captured by the usual covariance matrix but rather its $L_1$ analogues. Building on this insight, we establish minimax convergence rates for both the central and local privacy models and introduce an Information-Weighted Regression method that attains the optimal rates. As application, in private linear contextual bandits, we propose an efficient algorithm that achieves rate-optimal regret bounds of order $\sqrt{T}+\frac{1}α$ and $\sqrt{T}/α$ under joint and local $α$-privacy models, respectively. Notably, our results demonstrate that joint privacy comes at almost no additional cost, addressing the open problems posed by Azize and Basu (2024).
