Extending Characterizations of Multivariate Laws via Distance Distributions
Annika Betken, Aljosa Marjanovic, Katharina Proksch
TL;DR
This work broadens the identifiability framework for multivariate distributions by replacing the rigid homogeneity assumption with analytic conditions on distance-induced balls, namely volume-regularity, Lebesgue differentiability with uniformly bounded centered oscillations, and controlled density oscillations. It proves a generalized identifiability theorem: if i.i.d. samples X_i ~ f and Y_j ~ g have densities with the stated regularity, then equality of within-sample and between-sample interpoint distance distributions forces f and g to coincide, with corollaries recovering Maa's original theorem via monotone transformations and extending to compact Riemannian manifolds. The paper also provides quantitative L^2 bounds linking the density difference to Kolmogorov discrepancies of distance distributions, yielding dimension-aware rates under Ahlfors alpha-regularity and Hölder continuity of the densities. Through concrete examples—including Canberra, entropic, and Bray–Curtis distances—it demonstrates both the applicability and boundaries of the theory, offering a principled foundation for distance-based two-sample testing beyond translation-invariant geometries.
Abstract
We extend a theorem of Maa, Pearl, and Bartoszynski, which links equality of interpoint distance distributions to equality of underlying multivariate distributions, beyond the restrictive class of homogeneous, translation-invariant distance functions. Our approach replaces geometric assumptions on the distance with analytic conditions: volume-regularity of distance-induced balls, Lebesgue differentiability with respect to the distance, and bounded centered oscillations of densities. Under these conditions, equality of interpoint distance distributions continues to imply equality of the generating laws. The result persists under monotone continuous transformations of homogeneous, translation-invariant distances, recovering the original statement, and it extends to compact Riemannian manifolds equipped with the geodesic metric. We further develop a quantitative version of the theorem, i.e., inequalities that connect discrepancies of interpoint distance distributions to the $L^2$-distance between densities, and obtain explicit rates under Ahlfors $α$-regularity of the distance function and $β$-Hölder continuity of densities, capturing dependence on dimensionality. Several representative examples illustrate the applicability of the generalization to domain-specific distances used in modern statistics. The examples include non-homogeneous non-translation invariant distances such as Canberra, entropic distances, and the Bray--Curtis dissimilarity.
