Differentially Private Conditional Independence Testing
Iden Kalemaj, Shiva Prasad Kasiviswanathan, Aaditya Ramdas
TL;DR
This work addresses conditional independence testing under differential privacy, focusing on continuous conditioning variables $Z$. It introduces two DP CI tests, PrivGCM and PrivCRT, each built on solid non-private counterparts and accompanied by rigorous type-I error control and power guarantees. The analysis shows that privacy noise can, in some regimes, improve finite-sample type-I error behavior, while larger sample sizes mitigate power loss, with PrivCRT often delivering stronger power under model-X assumptions. The proposed methods are validated empirically on synthetic and real data, demonstrating robust privacy-preserving CI testing with practical applicability to sensitive domains like genomics and clinical data.
Abstract
Conditional independence (CI) tests are widely used in statistical data analysis, e.g., they are the building block of many algorithms for causal graph discovery. The goal of a CI test is to accept or reject the null hypothesis that $X \perp \!\!\! \perp Y \mid Z$, where $X \in \mathbb{R}, Y \in \mathbb{R}, Z \in \mathbb{R}^d$. In this work, we investigate conditional independence testing under the constraint of differential privacy. We design two private CI testing procedures: one based on the generalized covariance measure of Shah and Peters (2020) and another based on the conditional randomization test of Candès et al. (2016) (under the model-X assumption). We provide theoretical guarantees on the performance of our tests and validate them empirically. These are the first private CI tests with rigorous theoretical guarantees that work for the general case when $Z$ is continuous.
