PRIMO: Private Regression in Multiple Outcomes
Seth Neel
TL;DR
PRIMO introduces private regression in multiple outcomes, formalizing the joint objective $\min_W \|XW-Y\|_F^2$ under differential privacy. It develops two algorithmic families: ReuseCovGauss (Full DP) and ReuseCovProj (projection-based), achieving favorable privacy-utility tradeoffs across regimes and, in some cases, eliminating explicit dependence on the number of outcomes $l$ for large $l$. The methods exploit shared covariances across regressions by reusing noisy $X^{T}X$ and, in projection-based variants, privately releasing $X^{T}Y$ with improved $l$-scaling under feature or label DP. Empirical results on genomic datasets (1KG, dbGaP) show projection-based PRIMO can outperform naive baselines and remains effective for very large numbers of outcomes, highlighting practical impact for multi-phenotype genomic risk prediction under privacy constraints.
Abstract
We introduce a new private regression setting we call Private Regression in Multiple Outcomes (PRIMO), inspired by the common situation where a data analyst wants to perform a set of $l$ regressions while preserving privacy, where the features $X$ are shared across all $l$ regressions, and each regression $i \in [l]$ has a different vector of outcomes $y_i$. Naively applying existing private linear regression techniques $l$ times leads to a $\sqrt{l}$ multiplicative increase in error over the standard linear regression setting. We apply a variety of techniques including sufficient statistics perturbation (SSP) and geometric projection-based methods to develop scalable algorithms that outperform this baseline across a range of parameter regimes. In particular, we obtain no dependence on l in the asymptotic error when $l$ is sufficiently large. Empirically, on the task of genomic risk prediction with multiple phenotypes we find that even for values of $l$ far smaller than the theory would predict, our projection-based method improves the accuracy relative to the variant that doesn't use the projection.
