Anchor-based Maximum Discrepancy for Relative Similarity Testing
Zhijian Zhou, Liuhua Peng, Xunye Tian, Feng Liu
TL;DR
The paper introduces Anchor-based Maximum Discrepancy (AMD), a kernel-based framework for relative similarity testing with three distributions $\mathbb{U},\mathbb{P},\mathbb{Q}$. AMD defines the relative similarity as the maximum discrepancy across a space of deep kernels, effectively learning both the hypothesis and the kernel in Phase I, followed by a Phase II unified AMD test using wild bootstrap to assess significance. The method comes with theoretical guarantees, including consistency of the AMD estimator and asymptotic power advantages when the learned direction aligns with the true relative similarity, and is validated on benchmarks (MNIST/CIFAR-10) and practical tasks like relative model evaluation and adversarial perturbation detection. The work demonstrates improved test power across regimes, provides open-source code, and discusses limitations such as potential overfitting and runtime, while outlining directions to extend relative similarity testing to more distributions.
Abstract
The relative similarity testing aims to determine which of the distributions, P or Q, is closer to an anchor distribution U. Existing kernel-based approaches often test the relative similarity with a fixed kernel in a manually specified alternative hypothesis, e.g., Q is closer to U than P. Although kernel selection is known to be important to kernel-based testing methods, the manually specified hypothesis poses a significant challenge for kernel selection in relative similarity testing: Once the hypothesis is specified first, we can always find a kernel such that the hypothesis is rejected. This challenge makes relative similarity testing ill-defined when we want to select a good kernel after the hypothesis is specified. In this paper, we cope with this challenge via learning a proper hypothesis and a kernel simultaneously, instead of learning a kernel after manually specifying the hypothesis. We propose an anchor-based maximum discrepancy (AMD), which defines the relative similarity as the maximum discrepancy between the distances of (U, P) and (U, Q) in a space of deep kernels. Based on AMD, our testing incorporates two phases. In Phase I, we estimate the AMD over the deep kernel space and infer the potential hypothesis. In Phase II, we assess the statistical significance of the potential hypothesis, where we propose a unified testing framework to derive thresholds for tests over different possible hypotheses from Phase I. Lastly, we validate our method theoretically and demonstrate its effectiveness via extensive experiments on benchmark datasets. Codes are publicly available at: https://github.com/zhijianzhouml/AMD.
