Testing Conditional Mean Independence Using Generative Neural Networks
Yi Zhang, Linjun Huang, Yun Yang, Xiaofeng Shao
TL;DR
This work develops a fully nonparametric conditional mean independence test for multivariate responses by introducing a population measure $\Gamma^*$ that characterizes $H_0: \mathbb{E}[Y|X,Z] = \mathbb{E}[Y|Z]$. The authors construct a sample statistic $\widehat{T}_n$ using cross-fitting, RKHS-based embeddings, and a conditional generator (GMMN) to obtain samples from $P_{X|Z}$, with a wild bootstrap to calibrate the null distribution. The method achieves double robustness to slow nonparametric estimation errors and demonstrates strong finite-sample performance in high-dimensional settings, including imaging data, while maintaining power against local alternatives at the parametric rate. The approach is validated through simulations and two imaging tasks (facial expression recognition and age estimation), illustrating practical applicability to high-dimensional covariates and multivariate outcomes. Overall, the paper offers a principled, scalable CMI testing framework that provides reliable size control and competitive power without restrictive parametric assumptions.
Abstract
Conditional mean independence (CMI) testing is crucial for statistical tasks including model determination and variable importance evaluation. In this work, we introduce a novel population CMI measure and a bootstrap-based testing procedure that utilizes deep generative neural networks to estimate the conditional mean functions involved in the population measure. The test statistic is thoughtfully constructed to ensure that even slowly decaying nonparametric estimation errors do not affect the asymptotic accuracy of the test. Our approach demonstrates strong empirical performance in scenarios with high-dimensional covariates and response variable, can handle multivariate responses, and maintains nontrivial power against local alternatives outside an $n^{-1/2}$ neighborhood of the null hypothesis. We also use numerical simulations and real-world imaging data applications to highlight the efficacy and versatility of our testing procedure.
