Minimax-Optimal Two-Sample Test with Sliced Wasserstein

Binh Thuan Tran; Nicolas Schreuder

Minimax-Optimal Two-Sample Test with Sliced Wasserstein

Binh Thuan Tran, Nicolas Schreuder

TL;DR

This work develops a new nonparametric two-sample test based on the sliced Wasserstein distance. By using a permutation framework, it achieves finite-sample Type I error control and provides non-asymptotic power guarantees, showing minimax optimality with separation rate $n^{-1/2}$ over multinomial and bounded-support alternatives. The method leverages random projections to reduce dimensionality and analyzes the trade-off between the number of projections and statistical power, while maintaining scalability through efficient computation and parallelism. Empirical results on synthetic data and MNIST demonstrate robust performance without kernel tuning, highlighting the method’s practical utility for geometry-aware distribution testing in high dimensions.

Abstract

We study the problem of nonparametric two-sample testing using the sliced Wasserstein (SW) distance. While prior theoretical and empirical work indicates that the SW distance offers a promising balance between strong statistical guarantees and computational efficiency, its theoretical foundations for hypothesis testing remain limited. We address this gap by proposing a permutation-based SW test and analyzing its performance. The test inherits finite-sample Type I error control from the permutation principle. Moreover, we establish non-asymptotic power bounds and show that the procedure achieves the minimax separation rate $n^{-1/2}$ over multinomial and bounded-support alternatives, matching the optimal guarantees of kernel-based tests while building on the geometric foundations of Wasserstein distances. Our analysis further quantifies the trade-off between the number of projections and statistical power. Finally, numerical experiments demonstrate that the test combines finite-sample validity with competitive power and scalability, and -- unlike kernel-based tests, which require careful kernel tuning -- it performs consistently well across all scenarios we consider.

Minimax-Optimal Two-Sample Test with Sliced Wasserstein

TL;DR

over multinomial and bounded-support alternatives. The method leverages random projections to reduce dimensionality and analyzes the trade-off between the number of projections and statistical power, while maintaining scalability through efficient computation and parallelism. Empirical results on synthetic data and MNIST demonstrate robust performance without kernel tuning, highlighting the method’s practical utility for geometry-aware distribution testing in high dimensions.

Abstract

over multinomial and bounded-support alternatives, matching the optimal guarantees of kernel-based tests while building on the geometric foundations of Wasserstein distances. Our analysis further quantifies the trade-off between the number of projections and statistical power. Finally, numerical experiments demonstrate that the test combines finite-sample validity with competitive power and scalability, and -- unlike kernel-based tests, which require careful kernel tuning -- it performs consistently well across all scenarios we consider.

Minimax-Optimal Two-Sample Test with Sliced Wasserstein

TL;DR

Abstract

Minimax-Optimal Two-Sample Test with Sliced Wasserstein

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (31)