Universally Consistent K-Sample Tests via Dependence Measures

Sambit Panda; Cencheng Shen; Ronan Perry; Jelle Zorn; Antoine Lutz; Carey E. Priebe; Joshua T. Vogelstein

Universally Consistent K-Sample Tests via Dependence Measures

Sambit Panda, Cencheng Shen, Ronan Perry, Jelle Zorn, Antoine Lutz, Carey E. Priebe, Joshua T. Vogelstein

TL;DR

It is proved that independence tests achieve universally consistent k- sample testing and that k-sample statistics such as Energy and Maximum Mean Discrepancy (MMD) are precisely equivalent to Dcorr.

Abstract

The K-sample testing problem involves determining whether K groups of data points are each drawn from the same distribution. Analysis of variance is arguably the most classical method to test mean differences, along with several recent methods to test distributional differences. In this paper, we demonstrate the existence of a transformation that allows K-sample testing to be carried out using any dependence measure. Consequently, universally consistent K-sample testing can be achieved using a universally consistent dependence measure, such as distance correlation and the Hilbert-Schmidt independence criterion. This enables a wide range of dependence measures to be easily applied to K-sample testing.

Universally Consistent K-Sample Tests via Dependence Measures

TL;DR

Abstract

Universally Consistent K-Sample Tests via Dependence Measures

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (13)