Sliced Wasserstein with Random-Path Projecting Directions
Khai Nguyen, Shujian Zhang, Tam Le, Nhat Ho
TL;DR
The paper tackles the inefficiency of selecting slicing directions in Sliced Wasserstein distances by introducing Random-Path Projecting Direction (RPD) and Random-Path Slicing Distribution (RPSD), enabling optimization-free sampling. It defines two SW variants, Random-Path Projection Sliced Wasserstein (RPSW) and Importance Weighted RPSW (IWRPSW), and establishes their theoretical properties, including topological and statistical aspects and favorable computational complexity. The authors demonstrate that RPSW and IWRPSW outperform traditional SW variants in gradient-flow tasks and in training denoising diffusion models, achieving faster convergence and improved generation quality with reduced compute. This approach offers a scalable, discriminative, and practical alternative to optimization-based slicing schemes, with potential extensions to manifolds and a wide range of downstream applications in generative modeling and beyond.
Abstract
Slicing distribution selection has been used as an effective technique to improve the performance of parameter estimators based on minimizing sliced Wasserstein distance in applications. Previous works either utilize expensive optimization to select the slicing distribution or use slicing distributions that require expensive sampling methods. In this work, we propose an optimization-free slicing distribution that provides a fast sampling for the Monte Carlo estimation of expectation. In particular, we introduce the random-path projecting direction (RPD) which is constructed by leveraging the normalized difference between two random vectors following the two input measures. From the RPD, we derive the random-path slicing distribution (RPSD) and two variants of sliced Wasserstein, i.e., the Random-Path Projection Sliced Wasserstein (RPSW) and the Importance Weighted Random-Path Projection Sliced Wasserstein (IWRPSW). We then discuss the topological, statistical, and computational properties of RPSW and IWRPSW. Finally, we showcase the favorable performance of RPSW and IWRPSW in gradient flow and the training of denoising diffusion generative models on images.
