Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests
T. Tony Cai, Abhinav Chakraborty, Lasse Vuursteen
TL;DR
The paper tackles federated nonparametric goodness-of-fit testing under distributed $(\epsilon,\delta)$-DP, focusing on the white-noise-with-drift model and Besov smoothness classes. It develops minimax lower and upper bounds for the separation rate $\rho$, revealing multiple phase transitions and showing that access to a shared randomness source can improve testing rates. It then constructs three DP-compliant testing procedures that are optimal (up to logarithmic factors) in different privacy-budget regimes, and extends them to adaptive, data-driven tests when the regularity parameter $s$ is unknown. The results quantify the privacy-cost of distributed testing and provide adaptive strategies with minimal DP overhead, offering guidance for designing privacy-preserving federated hypothesis tests in high-dimensional nonparametric settings.
Abstract
Federated learning has attracted significant recent attention due to its applicability across a wide range of settings where data is collected and analyzed across disparate locations. In this paper, we study federated nonparametric goodness-of-fit testing in the white-noise-with-drift model under distributed differential privacy (DP) constraints. We first establish matching lower and upper bounds, up to a logarithmic factor, on the minimax separation rate. This optimal rate serves as a benchmark for the difficulty of the testing problem, factoring in model characteristics such as the number of observations, noise level, and regularity of the signal class, along with the strictness of the $(ε,δ)$-DP requirement. The results demonstrate interesting and novel phase transition phenomena. Furthermore, the results reveal an interesting phenomenon that distributed one-shot protocols with access to shared randomness outperform those without access to shared randomness. We also construct a data-driven testing procedure that possesses the ability to adapt to an unknown regularity parameter over a large collection of function classes with minimal additional cost, all while maintaining adherence to the same set of DP constraints.
