Differentially private Bayesian tests
Abhisek Chakraborty, Saptati Datta
TL;DR
This paper integrates differential privacy with Bayesian hypothesis testing by formulating DP Bayes factors based on test statistics within a principled data-generative framework. It uses a hierarchical, partitioned model with mixture priors to keep Bayes factors bounded under privacy constraints, enabling a Laplace-based privacy mechanism and a data-driven cut-off to preserve predefined error rates. The authors derive closed-form expressions and consistency results for Bayes factors under z, t, χ^2, and F tests, and provide practical algorithms for hyperparameter tuning and power optimization. Through simulations and a DAIC-WOZ gender-d difference case study, the approach demonstrates interpretable Bayesian evidence under privacy budgets, offering a scalable alternative to differentially private frequentist testing with formal privacy guarantees.
Abstract
Differential privacy has emerged as an significant cornerstone in the realm of scientific hypothesis testing utilizing confidential data. In reporting scientific discoveries, Bayesian tests are widely adopted since they effectively circumnavigate the key criticisms of P-values, namely, lack of interpretability and inability to quantify evidence in support of the competing hypotheses. We present a novel differentially private Bayesian hypotheses testing framework that arise naturally under a principled data generative mechanism, inherently maintaining the interpretability of the resulting inferences. Furthermore, by focusing on differentially private Bayes factors based on widely used test statistics, we circumvent the need to model the complete data generative mechanism and ensure substantial computational benefits. We also provide a set of sufficient conditions to establish results on Bayes factor consistency under the proposed framework. The utility of the devised technology is showcased via several numerical experiments.
