Spectral Regularized Kernel Goodness-of-Fit Tests
Omar Hagrass, Bharath K. Sriperumbudur, Bing Li
TL;DR
The paper addresses goodness-of-fit testing in non-Euclidean spaces via RKHS embeddings and shows the classical MMD-based test is not minimax optimal under general conditions. It introduces a spectral regularization framework, defining $oldsymbol{ exteta}_{oldsymbol{\lambda}}$ with a general regularizer $g_{oldsymbol{\lambda}}$, which subsumes Tikhonov regularization and relaxes the zero-mean kernel requirement. The authors develop an Oracle test, a computable two-sample statistic (when $P_0$ can be sampled), and two data-driven tests (SRCT and SRPT) with adaptation over regularization and kernels, proving minimax optimality (up to log factors) for broad classes of alternatives characterized by eigen-decay rates. Empirical results on periodic-spline, Gaussian, and directional data show that SRCT and SRPT outperform MMD, energy, KS, and SR2T, and adaptivity further improves performance. The work advances practical, minimax-optimal goodness-of-fit testing for non-Euclidean data and lays groundwork for kernel-regularization approaches in related hypothesis testing tasks.
Abstract
Maximum mean discrepancy (MMD) has enjoyed a lot of success in many machine learning and statistical applications, including non-parametric hypothesis testing, because of its ability to handle non-Euclidean data. Recently, it has been demonstrated in Balasubramanian et al.(2021) that the goodness-of-fit test based on MMD is not minimax optimal while a Tikhonov regularized version of it is, for an appropriate choice of the regularization parameter. However, the results in Balasubramanian et al. (2021) are obtained under the restrictive assumptions of the mean element being zero, and the uniform boundedness condition on the eigenfunctions of the integral operator. Moreover, the test proposed in Balasubramanian et al. (2021) is not practical as it is not computable for many kernels. In this paper, we address these shortcomings and extend the results to general spectral regularizers that include Tikhonov regularization.
